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ni igc feat u r c 
(12 7 3) . . (3091) 
< 1 U n t r a n slat e ci R e g : c n 



:3 > 



CDS 

(148) . . (1272; 



Met Arg lie Leu Lys Arg Phe Leu Ala 



tgr att rag etc :tc tgt gtt tgc eg:: 7tg gat tgg get aat gga tae 222 

Cys lie Gin Leu Leu Cys Val Cys Arg Leu Asp Trp Ala Asn Gly Tyr 

10 15 2 0 2 5 

tae aga eaa cag aga aaa ett gtt gaa gag att ggc tgg tec tat aca 270 

Tyr Arg Gin Gin Arg Lys Leu Val Glu Glu lie Gly Trp Ser Tyr Thr 
30 35 40 

gga gca etg aat caa aaa aat tgg gga aag aaa tat cca aca tgt aat 318 

Gly Ala Leu Asn Gin Lys Asn Trp Gly Lys Lys Tyr Pro Thr Cys Asn 
45 50 55 

age cca aaa caa tct cot ate aat att gat gaa gat ett aca eaa gta 366 

Ser Pro Lys Gin Ser Pro lie Asn lie Asp Glu Asp Leu Thr Gin Val 
60 65 70 

aat gtg aat ett aag aaa ett aaa ttt cag ggt tgg gat aaa aca tea 414 

Asn Val Asn Leu Lys Lys Leu Lys Phe Gin Gly Trp Asp Lys Thr Ser 

7 5 8 0 8 5 

ttg gaa aac aca ttc att eat aae act ggg aaa aca gtg gaa att aat 462 

Leu Glu Asn Thr Phe lie His Asn Thr Gly Lys Thr Val Glu lie Asn 

90 95 100 105 

etc act aat gae tae cgt gtc age gga gga gtt tea gaa atg gtg ttt 510 

Leu Thr Asn Asp Tyr Arg Val Ser Gly Gly Val Ser Glu Met Val Phe 
110 115 120 

aaa gca age aag ata act ttt eae tgg gga aaa tgc aat atg tea tct 558 

Lys Ala Ser Lys He Thr Phe His Trp Gly Lys Cys Asn Met Ser Ser 
125 130 135 

gat gga tea gag eat agt tta gaa gga caa aaa ttt cca ett gag atg 606 

Asp Gly Ser Glu His Ser Leu Glu Gly Gin Lys Phe Pro Leu Glu Met 
140 145 150 

:aa ate tae tgc ttt gat gcg gae ega ttt tea agt ttt gag gaa gca 6 54 

Gin lie Tyr Cys Phe Asp Ala Asp Arg Phe Ser Ser Phe Glu Glu Ala 

15 5 16 0 16 5 

gte aaa gga aaa ggg aag tta aga get tta tec att ttg ttt gag gtt 702 

Val Lys Gly Lys Gly Lys Leu Arg Ala Leu Ser lie Leu Phe Glu Val 

170 175 18 0 185 

ggg aca gaa gaa aat ttg gat ttc aaa gcg att att gat gga gtc gaa 750 

Gly Thr Glu Glu Asn Leu Asp Phe Lys Ala lie lie Asp Gly Val Glu 

190 195 200 



220 225 230 

tea ttg aca tct cct ccc tgc aca gac aca gtt gac tgg att gtt ttt 894 

Ser Leu Thr Ser Pro Pro Cys Thr Asp Thr Val Asp Trp lie Val Phe 

235 240 245 

aaa gat aca gtt age ate tct gaa age cag ttg get gtt ttt tgt gaa 942 

Lys Asp Thr Val Ser lie Ser Glu Ser Gin Leu Ala Val Phe Cys Glu 
250 255 260 265 

gtt ctt aca atg caa caa tct ggt tat gtc atg ctg atg gac tac tta 990 

Val Leu Thr Met Gin Gin Ser Gly Tyr Val Met Leu Met Asp Tyr Leu 

27 0 275 280 

caa aac aat ttt cga gag caa cag tac aag ttc tct aga cag gtg ttt 1038 

Gin Asn Asn Phe Arg Glu Gin Gin Tyr Lys Phe Ser Arg Gin Val Phe 
285 290 295 

tec tea tac act gga aag gaa gag att cat gaa gca gtt tgt agt tea 1086 

Ser Ser Tyr Thr Gly Lys Glu Glu lie His Glu Ala Val Cys Ser Ser 
300 305 310 

gaa cca gaa aat gtt cag get gac cca gag aat tat acc age ctt ctt 1134 

Glu Pro Glu Asn Val Gin Ala Asp Pro Glu Asn Tyr Thr Ser Leu Leu 

315 320 325 

gtt aca tgg gaa aga cct cga gtc gtt tat gat acc atg att gag aag 1182 

Val Thr Trp Glu Arg Pro Arg Val Val Tyr Asp Thr Met lie Glu Lys 
330 335 340 345 

ttt gca gtt ttg tac cag cag ttg gat gga gag gac caa acc aag cat 1230 

Phe Ala Val Leu Tyr Gin Gin Leu Asp Gly Glu Asp Gin Thr Lys His 
350 355 360 

gaa ttt ttg aca gat ggc tat caa gac ttg gta act ata tga 1272 

Glu Phe Leu Thr Asp Gly Tyr Gin Asp Leu Val Thr lie 
365 370 

tcagttgttt tacatagggt aacattataa tttaatttcc aaggtaagaa cttacaaatg 1332 

gt t.gtatat. t at t ttcct.c : attactttta gactttatgt gaaggtgggg taggctgagt 13.92 

ttttaaat ttaaaaaaaa at.t t taaat.t aga, age tat. a ctaaattatg tttaaagtta 1452 

catttaatta aatggatat: ataactttgc caacaataac actatagagt agata:atat 1512 

gacttatgaa ctggagatca tttagtgtgg cctttcttaa gatttcagtt gtagaatagt 1572 

gecagaatet cagtgccetg atacatttta tattgtgtct tccattacgc tatateagca 16 32 



r:-: a anaotaaoan arataeaaat cct ctt tart acarcaaaaa attttcagat 1692 



ggtgtagttg acaaaccttc atctggatgt aggtccagaa agtccccact gcaggttaaa 1932 

ggacactgga ctctgcactc aggcacctag agtcctgcaa gtcctgggaa cctgcattta 1992 

aataaaaatg cactattaat tatgtttcat atcatgtgga caaaatggat aaaattttag 2052 

taacctttta attcagttgc ctggaatatg gagacacaat gacctgggaa aatcgtgaaa 2112 

taaatagtaa taaaaatgtt tatttcataa ttacgtgaag aagataattc tattactgtt 2172 

cttgcatata tattgtcaag aaaaagagat aacttagttg ttcacttttt cacattgctc 2232 

cttgtttgca aatgcccccc atttatttgt ctaaaatatt aatttttagt ttgtagtact 2292 

aatttatgaa tttgatgagt tctggctaaa aatgaaactt cctgaaacta aatctgattt 2352 

ttaaaaaqca aaaaaaaaaa aaaagcctag ctttccagtt cttcataatt cacaaatacc 2412 

acaagtttaa ctaagcaaca ttgcataaac ttttccttag gttaataaaa tagaagtatt 2472 

ttccacggac cagggagaaa aagttttcta ggaaagatac ctagtgtgtt ggtagtccta 2532 

tgagaataac atttgtataa ttactaacat ctttctttta gggtgctatt ctcaataatt 2592 

tgctacccaa tatgagttat gttcttcaga tagtagccat atgcactaat ggcttatatg 2652 

gaaaatacag cgaccaactg attgtcgaca tgcctactga taatcctggt aagtgccacc 2712 

agatacatct atatattaac tcaataaatg aggttagttt aattactgta tgcattgatg 2772 

ctttctctct atattctttt ggccaaaagg caaagtgatt ttctcttaag tctggattgc 2832 

cgggtaattt tttggggcat gggacccatt tctcattcag caggtctggt gccagacaat 2892 

aagtaaactt atccttaata ttggagttta ccatttgtaa aataagagtg actaaacata 2952 

tttataacat tgtaataatc attaaatgaa aattgctatg taaatgttga gactgttatt 3012 

tt.ggataatt aagagttggt ttaatttgta tttatttcct cttttcagcc cccaaagcat ".072 

t at gtagtaa gtgtataca .-.0 91 



<210> 2 

<211:> 374 

<212> PRT 

<213;> Homo sapiens 



Met Arg lie Leu Lys Arg Phe Leu Ala Cys lie Gin Leu Leu Cys Val 
15 10 15 



Cys Arg Leu Asp Trp Ala Asn Gly Tyr Tyr Arg Gin Gin Arg Lys Leu 

2 0 2 5 3 0 



Val Glu Glu lie Gly Trp Ser Tyr Thr Gly Ala Leu Asn Gin Lys Asn 
35 40 45 



Trp Gly Lys Lys Tyr Pro Thr Cys Asn Ser Pro Lys Gin Ser Pro lie 
50 55 60 



Asn lie Asp Glu Asp Leu Thr Gin Val Asn Val Asn Leu Lys Lys Leu 
65 70 75 80 



Lys Phe Gin Gly Trp Asp Lys Thr Ser Leu Glu Asn Thr Phe lie His 
95 90 95 



Asn Thr Gly Lys Thr Val Glu lie Asn Leu Thr Asn Asp Tyr Arg Val 

100 105 110 



Ser Gly Gly Val Ser Glu Met Val Phe Lys Ala Ser Lys lie Thr Phe 
115 120 125 



His Trp Gly Lys Cys Asn Met Ser Ser Asp Gly Ser Glu His Ser Leu 
130 135 140 



Glu Gly Gin Lys Phe Pro Leu Glu Met Gin lie Tyr Cys Phe Asp Ala 

145 150 155 160 



A s v Arc: Phe Ser Ser Phe Glu Glu Ala Val Lvs Glv Lys Glv Lys Leu 



Arg Ala Leu Ser lie Leu Phe Glu Val Gly Thr Glu Glu Asn Leu Asp 
180 185 190 



Phe Lys Ala He lie Asp Gly Val Glu Sei Val Sei Arg Phe Gly Lys 

^ Q r„ "> n n n C) r . 



Thr Asp Lys Tyr Tyr lie Tyr Asn Gly Ser Leu Thr Ser Pro Pro Cys 
225 230 235 240 



Thr Asp Thr Yal Asp Trp lie Val Phe Lys Asp Thr Val Ser lie Ser 
245 250 255 



Glu Ser Gin Leu Ala Val Phe Cys Glu Val Leu Thr Met Gin Gin Ser 
260 265 270 



Gly Tyr Val Met Leu Met Asp Tyr Leu Gin Asn Asn Phe Arg Glu Gin 

275 280 285 



Gin Tyr Lys Phe Ser Arg Gin Val Phe Ser Ser Tyr Thr Gly Lys Glu 
290 295 300 



Glu lie His Glu Ala Val Cys Ser Ser Glu Pro Glu Asn Val Gin Ala 
305 310 315 320 



Asp Pro Glu Asn Tyr Thr Ser Leu Leu Val Thr Trp Glu Arg Pro Arg 

325 330 335 



Val Val Tyr Asp Thr Met lie Glu Lys Phe Ala Val Leu Tyr Gin Gin 
340 345 350 



Leu Asp Gly Glu Asp Gin Thr Lys His Glu Phe Leu Thr Asp Gly Tyr 
355 360 365 



Gin Asp Leu Val Thr lie 

3 70 



..210 
.2 11 

:213 

;220 
= 221 
; 2 2 2 
■ ^ ^ 3 



8^58 
DNA 

Homo sapiens 



gene 

( 1 ) . . (8058) 

PTP-zeta SM2 exon 2 3a variant 



;222> (7210) . . (8058) 

-;223 > 3' Untranslated Region 



<4 00> 3 

cacacatacg cacgcacgat ctcacttcga tctatacact ggaggattaa aacaaacaaa 60 

caaaaaaaac atttccttcg ctccccctcc ctctccactc tgagaagcag aggagccgca 120 

c gg c 9 a 9999 ccgcagaccg tctggaaatg cgaatcctaa agcgtttcct cgcttgcatt 180 

cagctcctct gtgtttgccg cctggattgg gctaatggat actacagaca acagagaaaa 240 

cttgttgaag agattggctg gtcctataca ggagcactga atcaaaaaaa ttggggaaag 300 

aaatatccaa catgtaatag cccaaaacaa tctcctatca atattgatga agatcttaca 360 

caagtaaatg tgaatcttaa gaaacttaaa tttcagggtt gggataaaac atcattggaa 420 

aacacattca ttcataacac tgggaaaaca gtggaaatta atctcactaa tgactaccgt 480 

gtcagcggag gagtttcaga aatggtgttt aaagcaagca agataacttt tcactgggga 540 

aaatgcaata tgtcatctga tggatcagag catagtttag aaggacaaaa atttccactt 600 

gagatgcaaa tctactgctt tgatgcggac cgattttcaa gttttgagga agcagtcaaa 660 

ggaaaaggga agttaagagc tttatccatt ttgtttgagg ttgggacaga agaaaatttg 720 

gatttcaaag cgattattga tggagtcgaa agtgttagtc gttttgggaa gcaggctgct 730 

ttagatccat tcatactgtt gaaccttctg ccaaactcaa ctgacaagta ttacatttac 840 

aatggctcat tgacatctcc tccctgcaca gacacagttg actggattgt ttttaaagat 900 

acagttagca tctctgaaag ccagttggct gttttttgtg aagttcttac aatgcaacaa 960 

tctggttatg tcatgctgat ggactactta caaaacaatt ttcgagagca acagtacaag 1020 

ttctctagac aggtgttttc ctcatacact ggaaaggaag agatteatga agcagtttgt 1080 

agttcagaa: cagaaaatgt tcaggctgac ccagagaatt ataccagcct tcttgttaca 114 0 

tgggaaagac ctcgagtcgt ttatgacacc atgattgaga agtttgcagt tttgtaccag ll^O 

cagttggatg gagaggac :a aaccaagcat gaatttttga cagatggcta tcaagacttg 1260 

ggtgctattc tcaataattt gctacccaat atgagttatg ttcttcagat agtagccata 1320 

t gcac taatg gcttatatgg aaaaLacagi: gaccaacLga 1 1 g t cgaca t. gcc tact gat 1380 



argaaataca atgaagccaa gactaac:ga tccccaacaa gaggaagtga attctctgga 1620 

aagggtgatg ttcccaatac atctttaaat tccactt:c: aaccagtcac taaattagcc 1680 

aragaaaaag atatttcctt ga rttctcag actgtgactg aaccgccacc tcacactgtg 1740 

gaaggtactt cagcctct t t aaatgatgg:: tctaaaactg t tot t agate tccacatatg 1800 

aacttgtegg ggactgeaga atccttaaat acagtttcta taacagaata tgaggaggag 1860 

agtttattga. ccagttticaa gcttgatact ggagctgaag at tcttcagg ctccagtccc 1920 

goaacttctg ctatcccatt catctctgag aacatatccc aagggtatat attttcctcc 1980 

gaaaacccag agacaataac atatgatgtc cttataccag aatctgetag aaatgettec 204 0 

gaagattcaa cttcatcagg ttcagaagaa tcactaaagg atccttctat ggagggaaat 2100 

gtgtggtttc ctagctctac agacataaoa gcacagcccg atgttggatc aggcagagag 2160 

agctttctce agactaatta cactgagata cgtgttgatg aatctgagaa gacaaccaag 2220 

tccttttctg caggcccagt gatgtcacag ggtccctcag ttacagatct ggaaatgeca 2280 

cattattcta cctttgccta cttcocaact gaggtaacac etcatgettt taccccatcc 234 0 

tccagacaac aggatttggt ctccacggtc aacgtggtat actcgcagac aacccaaccg 24 0C 

gtatacaatg gtgagacacc tcttcaacct tcctacagta gtgaagtctt tcctctagtc 24 60 

acccctttgt tgcttgacaa tcagatcctc aacactaccc ctgctgcttc aagtagtgat 2S2 0 

tcggccttgc atgctacgcc tgtatttccc agtgtcgatg tgtcatttga atccatcctg 2580 

tcttcctatg atggtgeacc tttgettcca ttttcctctg cttccttcag tagtgaattg 264 0 

tttcgccatc tgcatacagt ctctcaaatc cttecacaag ttacttcagc taccgagagt 2700 

gataaggtge ccttgcatgc ttctctgcca gtggctgggg gtgatttget attagagece 2760 

■igcc: :.q':*t.-:: aq:.att :':qa t gtg r t gz cc actactcatg ctg>:t: :aga gacgctggaa 28 2 C 

: zaqz aqt q aa: ctggtgt :c: t at. aaa acg:tta:g: tt^^tcaagt. t.gaaccaccc 2 8 8 

ageagegatg :■ ::atgatg :a igca:gtic: teagggectg aacc 1 1 z t za tgccccgtct 2 94C 

gataatgagg gctcccaaca catcttcact gcttcttaca gttctgcaat acctgtgcat 3 000 

gattctgtgg gtgtaactta tcagggttcc ttatttagcg gccctagcca tataccaata 3 C- 6 <0 

aaa; r cqttaataac cccaactqca tcat.tact.gc agcctactca tqccctctct 3120 



artgaactgc aaattccttc tttcaatgag atggtttacc cttctgaaag cacagtcatg 3360 

cccaacatgt atgataatgt aaataagttg aatgcgtctr tacaagaaac ctctgtttcc 3420 

acttctagca ccaagggcat gtttccaggg tcccttgctc ataccaccac taaggttttt 3 4 3 C j 

gatcatgaga ttagtcaagt tccagaaaat aacttttcag ttcaacctac acatactgtc 3 54 (j 

tctcaagcat ctggtgacac ttcgct taaa cctgtgctta gtgcaaactc agagccagca 3601' 

tcctctgacc ctgcttctag tgaaatgtta tctccttcaa ctcagctctt attttatgag 3660 

acctcagctt cttttagtac tgaagtattg ctacaacctt cctttcaggc ttctgatgtt 37;:0 

gacaccttgc ttaaaactgt tcttccagct gtgcccagtg atccaatatt ggttgaaacc 3780 

cccaaagttg ataaaattag ttctacaatg ttgcatctca ttgtatcaaa ttctgcttca 3840 

agtgaaaaca tgctgcactc tacatctgta ccagtttttg atgtgtcgcc tacttctcat 3900 

atgcactctg cttcacttca aggtttgacc atttcctatg caagtgagaa atatgaacca 3960 

gttttgttaa aaagtgaaag ttcccaccaa gtggtacctt ctttgtacag taatgatgag 4 020 

ttgttccaaa cggccaattt ggagattaac caggcccatc ccccaaaagg aaggcatgta 4 080 

tttgctacac ctgttttatc aattgatgaa ccattaaata cactaataaa taagcttata 4 14 0 

cattccgatg aaattttaac ctccaccaaa agttctgtta ctggcaaggt atttgctggt 4200 

attccaacag ttgcttctga tacatttgta tctactgatc attctgttcc tataggaaat 4260 

gggcatgttg ccattacagc tgtttctccc cacagagatg gttctgtaac ctcaacaaag 4 320 

ttgctgtttc cttctaaggc aacttctgag ctgagtcata gtgccaaatc tgatgccggt 4 380 

ttagtgggtg gtggtgaaga tggtgacact gatgatgatg gtgatgatga tgatgacaga 444 0 

gatagtgatg gcttatccat tcataagtgt atgtcatgct catcctatag agaatcacag 4^00 

gaaaaggtaa ~gaatgattc agacacccac gaaaacagtc :tatggatca gaataatc:a 4 c : t 0 

■: va:. art cacta::tga gaat tctgaa gaagataata gagtcacaag igtatcc:.:a 4 (: 1 C 

gacagtcaaa ctggtatgga cagaagtcct ggtaaatcac catcagcaaa tgggctatcc 46&C 

caaaagcaca atgatggaaa agaggaaaat gacattcaga ctggtagtgc tctgcttcct 4 74C 

'-"'t cage rctg aatctaaagc at gggcagt t ct.aacaagtg at.gaagaaag tggatcaggg 4 800 



gaggcagagg ccagtaatag tagccatgag tctcgtattg gtctagctga ggggttggaa 5040 

tccgagaaga aggcagttat accccttgtg ategtgtcag ccctgacttt tatctgtcta 5100 

gtggttcttg tgggtattct catctactgg aggaaatgct tccagactgc acacttttac 51*50 

ttagaggaca gtacatcccc tagagttata tccacacctc caacacctat ctttccaatt 5220 

tcagatgatg tcggagcaat tccaataaag cactttccaa agcatgttgc agatttacat 52 8 0 

gcaagtagtg ggtttactga agaatttgag acactgaaag agttttacca ggaagtgcag 5340 

agctgtactg ttgacttagg tattacagca gacagctcca accacccaga caacaagcac 5400 

aagaatcgat acataaatat cgttgcctat gatcatagca gggttaagct agcacagctt 54 60 

gctgaaaagg atggcaaact gactgattat atcaatgcca attatgttga tggctacaac 5520 

agaccaaaag cttatattgc tgcccaaggc ccaetgaaat ccacagctga agatttctgg bbsu 

agaatgatat gggaacataa tgtggaagtt attgtcatga taacaaacct cgtggagaaa 564 0 

ggaaggagaa aatgtgatca gtactggcct gccgatggga gtgaggagta cgggaacttt 5700 

ctggtcactc agaagagtgt gcaagtgctt gcctattata ctgtgaggaa ttttactcta 5760 

agaaacacaa aaataaaaaa gggctcccag aaaggaagac ccagtggacg tgtggtcaca 5820 

cagtatcact acacgcagtg gcctgacatg ggagtaccag agtactccct gccagtgctg 5880 

acctttgtga gaaaggcagc ctatgccaag cgccatgcag tggggcctgt tgtcgtccac 594 0 

tgcagtgctg gagttggaag aacaggcaca tatattgtgc tagacagtat gttgcagcag 6 000 

attcaacacg aaggaactgt caacatattt ggcttcttaa aacacatccg ttcacaaaga 6 06 0 

aattatttgg cacaaactga ggagcaatat gtcttcattc atgatacact ggttgaggcc 6120 

atacttagta aagaaactga ggtgctggac agtcatattc atgcctatgt taatgcactc 6180 

c r c a t. t. v c t g g a :* cage a gg r a a a a :: a a a g c. t .a g .a g a aac a a t c caggg tctca c t. c t g 62 4 D 

t eacecaggc tggagtgcag agqca:aat : t.cg g: - v cact gcaa cc t z cc tctccctgg: 6 i 0 " 

L^aacLgaic t a re I. .• agect :ccga gtggctggga clatactcct gage cag tea 6 3 6 0 

aatatacagc agagtgacta ttctgcagcc ctaaagcaat gcaacaggga aaagaatcga 64 2 0 

acttcttcta tcatccctgt ggaaagatca agggttggca tctcatccct gagtggagaa 64 8 0 

ggcacagact. acat. caatge ctcctatatc atgggctatt accagagcaa tgaattcatc 654 0 



gaagaacaca aatgtctatc taatgaggaa aaacttataa ttcaggactt tatcttagaa 6780 

gctacacagg atgattatgt acttgaagtg aggcactttc agtgtcctaa atggccaaat 684 0 

ccagatagcc ccattagtaa aacttttgaa cttataagtg ttataaaaga agaagctgcc 6900 

aatagggatg ggcctatgat tgttcatgat gagcatggag gagtgacggc aggaactttc 6960 

tgtgctctga caacccttat gcaccaacta gaaaaagaaa attccgtgga tgtttaccag 7020 

gtagccaaga tgatcaatct gatgaggrca ggagtctttg ctgacattga gcagtatcag 7080 

tttctctaca aagtgatcct cagccttgtg agcacaaggc aggaagagaa tccatccacc 7140 

tctctggaca gtaatggtgc agcattgcct gatggaaata tagctgagag cttagagtct 7200 

ttagtttaac acagaaaggg gtggggggac tcacatctga gcattgtttt cctcttccta 7260 

aaattaggca ggaaaatcag tctagttctg ttatctgttg atttcccatc acctgacagt 7320 

aactttcatg acataggatt ctgccgccaa atttatatca ttaacaatgt gtgccttttt 7330 

gcaagacttg taatttactt attatgtttg aactaaaatg attgaatttt acagtatttc 7440 

taagaatgga attgtggtat ttttttctgt attgatttta acagaaaatt tcaatttata 7500 

gaggttagga attccaaact acagaaaatg tttgttttta gtgtcaaatt tttagctgta 7560 

tttgtagcaa ttatcaggtt tgctagaaat ataactttta atacagtagc ctgtaaataa 7620 

aacactcttc catatgatat tcaacatttt acaactgcag tattcaccta aagtagaaat 7680 

aatctgttac ttattgtaaa tactgcccta gtgtctccat ggaccaaatt tatatttata 7740 

attgtagatt tttatatttt actactgagt caagttttct agttctgtgt aattgtttag 7800 

tttaatgacg tagttcatta gctggtctta ctctaccagt tttctgacat tgtattgtgt 7860 

tacctaagtc attaactttg tttcagcatg taattttaac tt^tgtggaa aaragaaata 7920 

cct tear 1. 1 1 gaaagaagtt tttatgagaa taacacctta ccaaacattg t tcaaatggt 793C 

:ttLatccaa ggaattgcaa aaataaatat. aaatattgee attaaaaaaa aaaaaaaaaa 804 C 

aaaaaaaaaa aaaaaaaa 8058 



PRT 



)3> PTP-zeta SM2 23a exon variant 



< 4 0 0 > 4 

Met Arg lie Leu Lys Arg Phe Leu Ala Cys lie Gin Leu Leu Cys Val 
15 10 15 



Cys Arg Leu Asp Trp Ala Asn Gly Tyr Tyr Arg Gin Gin Arg Lys Leu 
20 25 30 



Val Glu Glu lie Gly Trp Ser Tyr Thr Gly Ala Leu Asn Gin Lys Asn 
35 40 45 



Trp Gly Lys Lys Tyr Pro Thr Cys Asn Ser Pro Lys Gin Ser Pro lie 
50 bb foU 



Asn lie Asp Glu Asp Leu Thr Gin Val Asn Val Asn Leu Lys Lys Leu 
65 70 75 80 



Lys Phe Gin Gly Trp Asp Lys Thr Ser Leu Glu Asn Thr Phe lie His 
85 90 95 



Asn Thr Gly Lys Thr Val Glu lie Asn Leu Thr Asn Asp Tyr Arg Val 
100 105 110 



Ser Gly Gly Val Ser Glu Met Val Phe Lys Ala Ser Lys He Thr Phe 
115 120 125 



His Trp Gly Lys Cys Asn Met Ser Ser Asp Gly Ser Glu His Ser Leu 
130 135 140 



Glu Gly Gin Lys Phe Pro Leu Glu Met Gin lie Tyr Cys Phe Asp Ala 

" 4 ^ 1 ^ 1 [ - [ - 1 £ C 



Asp Arg Phe Ser Ser Phe Glu Glu Ala Val Lys Gly Lys Gly Lys Leu 
165 17 0 175 



Arg Ala Leu Ser Tie Leu Phe Glu Val Gly Thr Glu Glu Asn Leu Asp 
180 185 190 



Gin Ala Ala Leu Asp Pro Phe He Leu Leu Asn Leu Leu Pro Asn Ser 
210 215 220 



Thr Asp Lys Tyr Tyr He Tyr Asn Gly Ser Leu Thr Ser Pro Pro Cys 
225 230 235 240 



Thr Asp Thr Val Asp Trp He Val Phe Lys Asp Thr Val Ser He Ser 
245 250 255 



;lu Ser Gin Leu Ala Val Phe Cys Glu Val Leu Thr Met Gin Gin Ser 
260 265 270 



Gly Tyr Val Met Leu Met Asp Tyr Leu Gin Asn Asn Phe Arg Glu Gin 
275 280 285 



Gin Tyr Lys Phe Ser Arg Gin Val Phe Ser Ser Tyr Thr Gly Lys Glu 
290 295 300 



Glu He His Glu Ala Val Cys Ser Ser Glu Pro Glu Asn Val Gin Ala 
305 310 315 320 



Asp Pro Glu Asn Tyr Thr Ser Leu Leu Val Thr Trp Glu Arg Pro Arg 
325 330 335 



Val Val Tyr Asp Thr Met He Glu Lys Phe Ala Val Leu Tyr Gin Gin 
340 345 350 



Leu Asp Gly Glu Asp Gin Thr Lys His Glu Phe Leu Thr Asp Gly Tyr 
355 360 365 



Gin Asp Leu Gly Ala He Leu Asn Asn Leu Leu Pro Asn Met Ser Tyr 

3 7 C 375 380 



Val Leu Gin He Val Ala He Cys Thr Asn Gly Leu Tyi Gly Lys Tyr 
385 3 90 395 400 



Ser Asp Gin Leu He Val Asp Met Pro Thr Asp Asn Pro Glu Leu Asp 

4 0 5 4 10 4 15 



435 440 445 



Ser Ala Thr Asn Gin He Arg Lys Lys Glu Pro Gin He Ser Thr Thr 
450 455 460 



Thr His Tyr Asn Arg He Gly Thr Lys Tyr Asn Glu Ala Lys Thr Asn 
465 470 475 480 



Arg Ser Pro Thr Arg Gly Ser Glu Phe Ser Gly Lys Gly Asp Val Pro 
485 490 495 



Asn Thr Ser Leu Asn Ser Thr Ser Gin Pro Val Thr Lys Leu Ala Thr 
500 505 510 



Glu Lys Asp He Ser Leu Thr Ser Gin Thr Val Thr Glu Leu Pro Pro 
515 520 525 



His Thr Val Glu Gly Thr Ser Ala Ser Leu Asn Asp Gly Ser Lys Thr 
530 535 540 



Val Leu Arg Ser Pro His Met Asn Leu Ser Gly Thr Ala Glu Ser Leu 
545 550 555 560 



Asn Thr Val Ser He Thr Glu Tyr Glu Glu Glu Ser Leu Leu Thr Ser 
565 570 575 



Phe Lys Leu Asp Thr Gly Ala Glu Asp Ser Ser Gly Ser Ser Pro Ala 

580 585 590 



Thr Ser Ala He Pro Phe He Ser Glu Asn lie Ser Gin Gly Tyr He 

595 60 0 6 05 



.■'he Ser Ser Glu Asn Pre G!u Thi He Thr Tyi Asp Yai Leu He Pic 

Y0 r, ; ^ r-. - p 



Glu Ser Ala Arg Asn Ala Ser Glu Asp Ser Thr Ser Ser Gly Ser Glu 

625 630 635 640 



Phe Leu Gin Thr Asn Tyr Thr Glu lie Arg Val Asp Glu Ser Glu Lys 
675 680 685 



Thr Thr Lys Ser Phe Ser Ala Gly Pro Val Met Ser Gin Gly Pro Ser 

690 695 700 



Val Thr Asp Leu Glu Met Pro His Tyr Ser Thr Phe Ala Tyr Phe Pro 
705 710 715 720 



Thr Glu Val Thr Pro His Ala Phe Thr Pro Ser Ser Arg Gin Gin Asp 
725 730 735 



i^eu Val ber inr Vai Asn Vai Vai Tyr ber Gin inr inr bin pro Vai 
740 745 750 



Tyr Asn Gly Glu Thr Pro Leu Gin Pro Ser Tyr Ser Ser Glu Val Phe 
755 760 765 



Pro Leu Val Thr Pro Leu Leu Leu Asp Asn Gin lie Leu Asn Thr Thr 
770 775 780 



Pro Ala Ala Ser Ser Ser Asp Ser Ala Leu His Ala Thr Pro Val Phe 

78 5 790 795 800 



Pro Ser Val Asp Val Ser Phe Glu Ser lie Leu Ser Ser Tyr Asp Gly 
805 810 815 



Ala Pro Leu Leu Pro Phe Ser Ser Ala Ser Phe Ser Ser Glu Leu Phe 

820 825 830 



Leu His Thr Val So r Gli: lie Leu Lit Gl:i Val :hr Ser Ala 
S35 5 4 0 S45 



Thr Glu Ser Asp Lys Val Pro Leu His Ala Ser Leu Pro Val Ala Gly 
850 85 5 860 



Gly Asp Leu Leu Leu Glu Pro Ser Leu Ala Gin Tyr Ser Asp Val Leu 



Gly Val Leu Tyr Lys Thr Leu Met Phe Ser Gin Val Glu Pro Pro Ser 

900 905 910 



Ser Asp Ala Met Met His Ala Arg Ser Ser Gly Pro Glu Pro Ser Tyr 

915 920 925 



Ala Leu Ser Asp Asn Glu Gly Ser Gin His lie Phe Thr Val Ser Tyr 
930 935 940 



Ser Ser Ala lie Pro Val His Asp Ser Val Gly Val Thr Tyr Gin Gly 
945 950 955 960 



Ser Leu Phe Ser Gly Pro Ser His lie Pro lie Pro Lys Ser Ser Leu 

965 970 975 



lie Thr Pro Thr Ala Ser Leu Leu Gin Pro Thr His Ala Leu Ser Gly 
980 985 990 



Asp Gly Glu Trp Ser Gly Ala Ser Ser Asp Ser Glu Phe Leu Leu Pro 
995 1000 1005 



Asp Thr Asp Gly Leu Thr Ala Leu Asn lie Ser Ser Pro Val Ser 
1010 1015 1020 



Val Ala Glu Phe Thr Tyr Thr Thr Ser Val Phe Gly Asp Asp Asn 
1025 1030 1035 



Lys Ala Leu Ser Lys Ser Glu lie lie Tyr Gly Asn Glu Thr Glu 
1040 1045 1050 



Leu Gin lie Pro Ser Phe Asn Glu Met Val Tyr Pro Ser Glu Sei 
I 0 5 5 1 0 6 0 1 0 -3 5 



Thr Val Met Pro Asn Met Tyr Asp Asn Val Asn Lys Leu Asn Ala 
1070 1075 10 3 0 



■fi Leu Gin Glu Thi Ser Val Sei lie Sei Ser Th.: Lys Gly Met. 
10 8 5 10 9 0 10 9 5 



;iu He Ser Gin Val Pro Glu Asn Asn Phe Ser Val Gin Pro Thr 
1115 1120 1125 



His Thr Val Ser Gin Ala Ser Gly Asp Thr Ser Leu Lys Pro Val 
113 0 113 5 114 0 



Leu Ser Ala Asn Ser Glu Pro Ala Ser Ser Asp Pro Ala Ser Ser 
1145 1150 1155 



Glu Met Leu Ser Pro Ser Thr Gin Leu Leu Phe Tyr Glu Thr Ser 
1160 1165 1170 



Ala Ser Phe Ser Thr Glu Val Leu Leu Gin Pro Ser Phe Gin Ala 
1175 1180 1185 



Ser Asp Val Asp Thr Leu Leu Lys Thr Val Leu Pro Ala Val Pro 
1190 1195 1200 



Ser Asp Pro He Leu Val Glu Thr Pro Lys Val Asp Lys He Ser 
1205 1210 1215 



Ser Thr Met Leu His Leu He Val Ser Asn Ser Ala Ser Ser Glu 
1220 1225 1230 



Asn Met Leu His Ser Thr Ser Val Pro Val Phe Asp Val Ser Pro 
1235 1240 1245 



Thr Ser His Met His Ser Ala Ser Leu Gin Gly Leu Thr He Ser 
1250 1255 1260 



Tyr Ala Ser Glu Lys Tyr Glu Pro Val Leu Leu Lys Ser Glu Sei 
12 6 5 12 7Q U75 



"< His Gin Val Val Pro Ser Leu Tyr Ser Asn Asp Glu Leu Phe 
1 2 S 0 1285 12 9 0 



Gin Thr Ala Asn Leu Glu He Asn Gin Ala His Pro Pro Lys Gly 

12 9 5 1 3 0 0 1 0 5 



# 



1325 1330 1335 



Ser Thr Lys Ser Ser Val Thr Gly Lys Val Phe Ala Gly lie Pro 
1340 1345 1350 



Thr Val Ala Ser Asp Thr Phe Val Ser Thr Asp His Ser Val Pro 
1355 1360 1365 



He Gly Asn Gly His Val Ala He Thr Ala Val Ser Pro His Arg 
1370 1375 1380 



Asp Gly Ser Val Thr Ser Thr Lys Leu Leu Phe Pro Ser Lys Ala 
1385 1390 1395 



Thr Ser Glu Leu Ser His Ser Ala Lys Ser Asp Ala Gly Leu Val 
1400 1405 1410 



Gly Gly Gly Glu Asp Gly Asp Thr Asp Asp Asp Gly Asp Asp Asp 
1415 1420 1425 



Asp Asp Arg Asp Ser Asp Gly Leu Ser He His Lys Cys Met Ser 
1430 1435 1440 



Cys Ser Ser Tyr Arg Glu Ser Gin Glu Lys Val Met Asn Asp Ser 
1445 1450 1455 



Asp Thr His Glu Asn Ser Leu Met Asp Gin Asn Asn Pro He Ser 
1460 1465 1470 



Tyr Ser Leu Ser Glu Asn Ser Glu Glu Asp Asn Arg Val Thr Ser 

147 5 14 8 0 14 8 5 



Val Sei" Ser Asp Ser Gin Thr Gly Met Asp Arg Ser Pro Gly Lye 
14 9 0 14 9 5 15 0 0 



Ser Pro Ser Ala Asn Gly Leu Ser Gin Lys His Asn Asp Gly Lys 

15C5 1510 1515 



Gly Ser Gly Gin Gly Thr Ser Asp Ser Leu Asn Glu Asn Glu Thr 
1550 1555 1560 



Ser Thr Asp Phe Ser Phe Ala Asp Thr Asn Glu Lys Asp Ala Asp 
1565 1570 1575 



Gly He Leu Ala Ala Gly Asp Ser Glu He Thr Pro Gly Phe Pro 
1580 1585 1590 



Gin Ser Pro Thr Ser Ser Val Thr Ser Glu Asn Ser Glu Val Phe 
1595 1600 1605 



His Val Ser Glu Ala Glu Ala Ser Asn Ser Ser His Glu Ser Arg 
1610 1615 1620 



He Gly Leu Ala Glu Gly Leu Glu Ser Glu Lys Lys Ala Val He 
1625 1630 1635 



Pro Leu Val lie Val Ser Ala Leu Thr Phe He Cys Leu Val Val 
1640 1645 1650 



Leu Val Gly He Leu He Tyr Trp Arg Lys Cys Phe Gin Thr Ala 
1655 1660 1665 



His Phe Tyr Leu Glu Asp Ser Thr Ser Pro Arg Val He Ser Thr 
1670 1675 1680 



Pre Pro Thr Pro lie Phe Pro He Ser Asp Asp Val Gly Ala He 

16 8 5 1 6 9 0 1 6 9 5 



-to He Lys His Phe Pic Lys His Va 1 Ala Asp Leu His Ala Sei 
17 00 17 0 5 17 10 



Ser Gly Phe Thr Glu Glu Phe Glu Thr Leu Lys Glu Phe Tyr Gin 
1715 1720 1725 



AH Asp Sei 



Val Ala Tyr Asp His Ser Arg Val Lys Leu Ala Gin Leu Ala Glu 
H 6 0 1765 1770 



Lys Asp Gly Lys Leu Thr Asp Tyr lie Asn Ala Asn Tyr Val Asp 
H75 1780 1785 



5ly Tyr Asn Arg Pro Lys Ala Tyr lie Ala Ala Gin Gly Pro Leu 
1' 7 90 17 95 1800 



Lys Ser Thr Ala Glu Asp Phe Trp Arg Met lie Trp Glu His Asn 
1805 1810 1815 



Val Glu Val He Val Met He Thr Asn Leu Val Glu Lys Gly Arg 
1820 1825 1830 



Arg Lys Cys Asp Gin Tyr Trp Pro Ala Asp Gly Ser Glu Glu Tyr 
1835 1840 1845 



Uy Asn Phe Leu Val Thr Gin Lys Ser Val Gin Val Leu Ala Tyr 
1850 1855 1860 



Tyr Thr Val Arg Asn Phe Thr Leu Arg Asn Thr Lys He Lys Ly: 
1865 1870 1875 



;ly Ser Gin Lys Gly Arg Pro Ser Gly Arg Val Val Thr Gin Tyr 
1880 1885 1890 



His Tyr Thr Gin Trp Pro Asp Met Gly Val Pro Glu Tyr Ser Leu 
1895 1900 1905 



Pro Val Leu Thr Phe Val Arg Lys Ala Ala Tyr Ala Lys Arg His 

1 c > 1 C 1 9 15 1. 9 2 0 



Ala Val Gly Pro Val Val Val His Cys Ser Ala Gly Val Gly Arg 
192 5 1930 1935 



Tin Gly Thr iyr lie Val Leu Asp Sei Met Leu Gin Gin He Gin 
: c <H ] f i4 5 1950 



# 



Ser Gin Arg Asn Tyr Leu Val Gin Thr Glu Glu Gin Tyr Val Phe 
1970 1975 1980 



lie His Asp Thr Leu Val Glu Ala lie Leu Ser Lys Glu Thr Glu 
19 85 .990 1995 



Val Leu Asp Ser His lie His Ala Tyr Val Asn Ala Leu Leu lie 

2 0 0 0 2 0 0 5 2 0 1 0 



Pro Gly Pro Ala Gly Lys Thr Lys Leu Glu Lys Gin Phe Gin Gly 
2015 2020 2025 



Leu Thr Leu Ser Pro Arg Leu Glu Cys Arg Gly Thr lie Ser Ala 
203 0 2 035 2 04 0 



His Cys Asn Leu Pro Leu Pro Gly Leu Thr Asp Pro Pro Thr Ser 
2045 2050 2055 



Ala Ser Arg Val Ala Gly Thr lie Leu Leu Ser Gin Ser Asn lie 

2 0 6 0 2 0 6 5 2 0 7 0 



Gin Gin Ser Asp Tyr Ser Ala Ala Leu Lys Gin Cys Asn Arg Glu 
2 0 75 2080 2 0 85 



Lys Asn Arg Thr Ser Ser He He Pro Val Glu Arg Ser Arg Val 
2090 2095 2100 



ily He Ser Ser Leu Ser Gly Glu Gly Thr Asp Tyr He Asn Ala 
2105 2110 2115 



Met Gly Tvr Tyr Gly Ser Asn Glu Phe He He Thr 
2 12 5 2 1 2 C 



;ii: His Pro Leu Leu His Chi He Lys Asp Phe Trp Arg Met He 
2135 214C 2145 



Trp Asp His Asn Ala Gin Leu Val Val Met He Pro Asp Gly Gin 



!180 2185 2190 



His Lys Cys Leu Ser Asn Glu Glu Lys Leu lie lie Gin Asp Phe 
2 19 5 2 2 0 0 2 2 0 5 



lie Leu Glu Ala Thr Gin Asp Asp Tyr Val Leu Glu Val Arg His 
2210 2215 2220 



Phe Gin Cys Pro Lys Trp Pro Asn Pro Asp Ser Pro lie Ser Lys 
2225 2230 22 3 5 



Thr Phe Glu Leu lie Ser Val lie Lys Glu Glu Ala Ala Asn Arg 
2240 2245 2250 



Asp Gly Pro Met lie Val His Asp Glu His Gly Gly Val Thr Ala 
2255 2260 2265 



Gly Thr Phe Cys Ala Leu Thr Thr Leu Met His Gin Leu Glu Lys 
2270 2275 2280 



Glu Asn Ser Val Asp Val Tyr Gin Val Ala Lys Met lie Asn Leu 
2 285 2290 2 295 



Met Arg Pro Gly Val Phe Ala Asp lie Glu Gin Tyr Gin Phe Leu 
2 3 0 0 2 3 0 5 2 310 



Tyr Lys Val lie Leu Ser Leu Val Ser Thr Arg Gin Glu Glu Asn 
2315 2320 2325 



Pro Ser Thr Ser Leu Asp Ser Asn Gly Ala Ala Leu Pro Asp Gly 



n 1 ._e Ala Glu Ser Leu Glu Ser Leu Va] 
3 4 5 2j50 



<210 * 5 

<2 11.> 7 941 

2 12 DMA 

< 2 1 3 -• Homo samens 



gene 

(1) . . (7941) 
PTP- zeta 



<4 00> 5 

cacacatacg cacgcacgat ctcacttcga tctatacact ggaggattaa aacaaacaaa 60 

:aaaaaaaac atttccttcg ctccccctcc ctctccactc tgagaagcag aggagccgca 120 

c 99 c 9 a 9939 ccgcagaccg tctggaa atg cga ate eta aag cgt ttc etc get 174 

Met Arg lie Leu Lys Arg Phe Leu Ala 



tgc att cag etc etc tgt gtt tgc cgc ctg gat tgg get aat gga tac 
Cys lie Gin Leu Leu Cys Val Cys Arg Leu Asp Trp Ala Asn Gly Tyr 



IE 



20 



>22 



tac aga caa cag aga aaa ctt gtt gaa gag att ggc tgg tec tat aca 
Tyr Arg Gin Gin Arg Lys Leu Val Glu Glu lie Gly Trp Ser Tyr Thr 



30 



35 



40 



}70 



gga gca ctg aat caa aaa aat tgg gga aag aaa tat cca aca tgt aat 
Gly Ala Leu Asn Gin Lys Asn Trp Gly Lys Lys Tyr Pro Thr Cys Asn 



45 



50 



55 



318 



age cca aaa caa tct cct ate aat att gat gaa gat ctt aca caa gta 
Ser Pro Lys Gin Ser Pro lie Asn lie Asp Glu Asp Leu Thr Gin Val 



60 



65 



70 



36 6 



aat gtg aat ctt aag aaa ctt aaa ttt cag ggt tgg gat aaa aca tea 
Asn Val Asn Leu Lys Lys Leu Lys Phe Gin Gly Trp Asp Lys Thr Ser 



7 5 



80 



8 5 



414 



ttg gaa aac aca ttc att cat aac act ggg aaa aca gtg gaa att aat 
Leu Glu Asn Thr Phe lie His Asn Thr Gly Lys Thr Val Glu lie Asn 



90 



95 



100 



10 5 



462 



■ rx „ l. ci 



at gac tac cgt gtc ag< 
Leu Tin' Ann Asp Tyr Arg Val Ser Gly G! 



ja gga gtt tea gaa atg gtg ttt 
7 al Ser Glu Met Val Phe 



aaa gca age aag ata art ttt cac egg gga aaa tgc aat atg tea tct 
Lys Ala Ser Lys He Thr Phe His Trp Gly Lys Cys Asn Met Ser Ser 



125 



130 



135 



gat gga tea gag cat agt tta gaa gga caa aaa ttt cca ctt gag atg 
Asp Gly Sei" Glu His Sei Leu Glu Gly Gin Lys Phe Pro Leu Glu Met 



14 ! 



150 



606 



Val Lys Gly Lys Gly Lys Leu Arg Ala Leu Ser lie Leu Phe Glu Val 

170 175 180 185 

ggg aca gaa gaa aat ttg gat ttc aaa gcg att att gat gga gtc gaa 750 

Gly Thr Glu Glu Asn Leu Asp Phe Lys Ala lie lie Asp Gly Val Glu 

190 195 200 

agt gtt agt cgt ttt ggg aag cag get get tta gat cca ttc ata ctg 798 

Ser Val Ser Arg Phe Gly Lys Gin Ala Ala Leu Asp Pro Phe lie Leu 

20 5 210 215 

ttg aac ctt ctg cca aac tea act gac aag tat tac att tac aat ggc 846 

Leu Asn Leu Leu Pro Asn Ser Thr Asp Lys Tyr Tyr lie Tyr Asn Gly 

220 225 230 

tea ttg aca tct cct ccc tgc aca gac aca gtt gac tgg att gtt ttt 894 

Ser Leu Thr Ser Pro Pro Cys Thr Asp Thr Val Asp Trp lie Val Phe 

235 240 245 

aaa gat aca gtt age ate tct gaa age cag ttg get gtt ttt tgt gaa 942 

Lys Asp Thr Val Ser He Ser Glu Ser Gin Leu Ala Val Phe Gys Glu 

250 255 260 265 

gtt ctt aca atg caa caa tct ggt tat gtc atg ctg atg gac tac tta 990 

Val Leu Thr Met Gin Gin Ser Gly Tyr Val Met Leu Met Asp Tyr Leu 

2 7 0 275 280 

caa aac aat ttt cga gag caa cag tac aag ttc tct aga cag gtg ttt 1038 

Gin Asn Asn Phe Arg Glu Gin Gin Tyr Lys Phe Ser Arg Gin Val Phe 

285 290 295 

tec tea tac act gga aag gaa gag att cat gaa gca gtt tgt agt tea 1086 

Ser Ser Tyr Thr Gly Lys Glu Glu He His Glu Ala Val Cys Ser Ser 

300 3 05 310 

gaa cca gaa aat gtt cag get gac cca gag aat tat ace age ctt ctt 1134 

Glu Pro Glu Asn Val Gin Ala Asp Pro Glu Asn Tyr Thr Ser Leu Leu 

315 320 325 

gtt aca tgg gaa aga :ct cga gtc gtt tat gat acc atg att gag aag 1182 

'/a I Tin Trp Glu Arc Pro Arg Val Val Tyr Asp Thr Met lie Glu Lys 

3 30 3 35 34 0 34 5 

ttt gca gtt ttg tac cag cag ttg gat gga gag gac caa acc 1 aag cat 12 3 0 

Phe Ala Val Leu Tyr Gin Gin Leu Asp Gly Glu Asp Gin Tin" Lys His 

35C 355 360 

gaa ttt ttg aca gat ggc tat caa gac ttg ggt get att etc aat aat 1278 

Glu Phe Leu Thr Asp Gly Tyr Gin Asp Leu Gly Ala He Leu Asn Asn 

3 6 5 3 7 c 3 ~ 5 



395 400 405 

act gat aat cct gaa ctt gat ctt ttc cct gaa tta att gga act gaa 1422 
Thr Asp Asn Pro Glu Leu Asp Leu Phe Pro Glu Leu lie Gly Thr Glu 
410 415 42 0 425 

gaa ata ate aag gag gag gaa gag gga aaa gac att gaa gaa ggc get 1470 
Glu lie lie Lys Glu Glu Glu Glu Gly Lys Asp lie Glu Glu Gly Ala 
430 435 440 

att gtg aat cct ggt aga gac agt get aca aac caa ate agg aaa aag 1518 
lie Val Asn Pro Gly Arg Asp Ser Ala Thr Asn Gin lie Arg Lys Lys 
445 450 455 

gaa ccc cag att tct ace aca aca cac tac aat cgc ata ggg acg aaa 1566 
Glu Pro Gin lie Ser Thr Thr Thr His Tyr Asn Arg lie Gly Thr Lys 
460 465 470 

tac aat gaa gec aag act aac cga tec cca aca aga gga agt gaa ttc 1614 
Tyr Asn Glu Ala Lys Thr Asn Arg Ser Pro Thr Arg Gly Ser Glu Phe 
475 480 48 5 

tct gga aag ggt gat gtt ccc aat aca tct tta aat tec act tec caa 1662 
Ser Gly Lys Gly Asp Val Pro Asn Thr Ser Leu Asn Ser Thr Ser Gin 
490 495 500 505 

cca gtc act aaa tta gee aca gaa aaa gat att tec ttg act tct cag 1710 
Pro Val Thr Lys Leu Ala Thr Glu Lys Asp lie Ser Leu Thr Ser Gin 
510 515 520 

act gtg act gaa ctg cca cct cac act gtg gaa ggt act tea gec tct 1758 
Thr Val Thr Glu Leu Pro Pro His Thr Val Glu Gly Thr Ser Ala Ser 
525 530 535 

tta aat gat ggc tct aaa act gtt ctt aga tct cca cat atg aac ttg 1806 
Leu Asn Asp Gly Ser Lys Thr Val Leu Arg Ser Pro His Met Asn Leu 
540 545 550 

teg 999 act g ca g aa tec tta aat aca g tt: tc t a ta aca gaa tat gag 1854 

Ser Gly Thr Ala Glu Ser Leu Asn Thr Val Ser lie Thr Glu Tyr Glu 

555 5 60 5 6 ^ 

gag gag agt tta ttg acc agt ttc aag ctt gat act gga get gaa gat. 1902 
Glu Glu Ser Leu Leu Thr Ser Phe Lys Leu Asp Thr Gly Ala Glu Asp 
570 575 580 565 

tct tea ggc tec agt ccc gca act tct get ate cca ttc ate tct gag 1950 
Ser Ser Gly Ser Ser Pro Ala Thr Ser Ala lie Pro Phe lie Ser Glu 

590 595 600 

aac ata t <t raa aaa tat ata ttt tec tec aaa aac cca aaa aca ata 1998 



tea act tea tea ggt tea gaa gaa tea eta aag gat cct tet atg gag 2094 

Ser Thr Ser Ser Gly Ser Glu Glu Ser Leu Lys Asp Pro Ser Met Glu 
635 640 645 

gga aat gtg tgg ttt cct age tct aca gac ata aca gca cag ccc gat 2 142 

Gly Asn Val Trp Phe Pro Ser Ser Thr Asp lie Thr Ala Gin Pro Asp> 

6 50 655 660 66 5 

gtt gga tea ggc aga gag age ttt etc cag act aat tac act gag ata 2 190 

Val Gly Ser Gly Arg Glu Ser Phe Leu Gin Thr Asn Tyr Thr Glu lie 

670 675 680 

cgt gtt gat gaa tet gag aag aca ace aag tec ttt tct gca ggc cea 2238 

Arg Val Asp Glu Ser Glu Lys Thr Thr Lys Ser Phe Ser Ala Gly Pro 

685 690 695 

gtg atg tea cag ggt ccc tea gtt aca gat etg gaa atg cca cat tat 2286 

Val Met Ser Gin Gly Pro Ser Val Thr Asp Leu Glu Met Pro His Tyr 

700 705 710 

tct acc ttt gee tac ttc cca act gag gta aca cct cat get ttt aec 23 34 

Ser Thr Phe Ala Tyr Phe Pro Thr Glu Val Thr Pro His Ala Phe Thr 
715 720 725 

cca tee tec aga eaa cag gat ttg gtc tec acg gte aac gtg gta tac 2382 

Pro Ser Ser Arg Gin Gin Asp Leu Val Ser Thr Val Asn Val Val Tyr 

730 735 740 745 

teg eag aca ace eaa ecg gta tac aat ggt gag aca cct ctt eaa cet 24 3 0 

Ser Gin Thr Thr Gin Pro Val Tyr Asn Gly Glu Thr Pro Leu Gin Pro 

750 755 760 

tee tac agt agt gaa gtc ttt cct eta gte acc cct ttg ttg ctt gac 2478 

Ser Tyr Ser Ser Glu Val Phe Pro Leu Val Thr Pro Leu Leu Leu Asp 

765 770 775 

aat eag ate etc aac act acc cct get get tea agt agt gat teg gec 2526 

Asn Gin lie Leu Asn Thr Thr Pro Ala Ala Ser Ser Ser Asp Ser Ala 

78 0 78 5 7 90 

ttg eat get a:q eet gta ttt: ccc agt: gtc gat gtg tea ttt gaa tee 2 574 

Leu His Ala Thr Pre Val Phe Pro Ser 7a 1 Asp Val Ser Phe Glu Ser 
7 9 5 3 C 0 3 0 5 

ate etg tet tec tat gat ggt gca cct ttg ctt cca ttt tec tct get 26.22 

lie Leu Ser Ser Tyr Asp Gly Ala Pro Leu Leu Pro Phe Ser Ser Ala 

810 815 3 20 825 

tee ttc agt agt gaa ttg ttt cgc cat etg cat aca gtt tct eaa ate 26~C 

Ser Phe Ser Ser Glu Leu Phe Ara His Leu His Thr Val Ser Gin He 



get tct ctg cca gtg get ggg ggt gat ttg eta tta gag ccc age ctt 2766 
Ala Ser Leu Pro Val Ala Gly Gly Asp Leu Leu Leu Glu Pro Ser Leu 
860 865 870 

get eag tat tct gat gtg ctg tec act act cat get get tea gag acg 2814 
Ala Gin Tyr Ser Asp Val Leu Ser Thr Thr His Ala Ala Ser Glu Thr 
8 75 8 30 885 

ctg gaa ttt ggt agt gaa tct ggt gtt ctt tat aaa acg ctt atg ttt 2862 
Leu Glu Phe Gly Ser Glu Ser Gly Val Leu Tyr Lys Thr Leu Met Phe 
890 895 900 905 

tct caa gtt gaa cca ccc age agt gat gec atg atg cat gca cgt tct 2910 
Ser Gin Val Glu Pro Pro Ser Ser Asp Ala Met Met His Ala Arg Ser 
910 915 920 

tea ggg cct gaa cct tct tat gee ttg tct gat aat gag ggc tec caa 2958 
Ser Gly Pro Glu Pro Ser Tyr Ala Leu Ser Asp Asn Glu Gly Ser Gin 
925 930 935 

cac ate ttc act gtt tct tac agt tct gca ata cct gtg cat gat tct 3006 
His lie Phe Thr Val Ser Tyr Ser Ser Ala lie Pro Val His Asp Ser 
940 945 950 

gtg ggt gta act tat cag ggt tec tta ttt age ggc cct age cat ata 3054 
Val Gly Val Thr Tyr Gin Gly Ser Leu Phe Ser Gly Pro Ser His lie 
955 960 965 

cca ata cct aag tct teg tta ata acc cca act gca tea tta ctg cag 3102 
Pro lie Pro Lys Ser Ser Leu lie Thr Pro Thr Ala Ser Leu Leu Gin 
970 975 980 985 

cct act cat gec etc tct ggt gat ggg gaa tgg tct gga gee tct tct 3150 
Pro Thr His Ala Leu Ser Gly Asp Gly Glu Trp Ser Gly Ala Ser Ser 
990 995 1000 

gat agt gaa ttt ctt tta cct gac aca gat ggg ctg aca gee ctt 3195 
Asp Ser Glu Phe Leu Leu Pro Asp Thr Asp Gly Leu Thr Ala Leu 
1005 1010 1015 

aac att tct tea cct gtt tct gta get gaa ttt aca tat aca aca i24P 
A c lie S e r 3 e r P r o Va 1 Ser V a 1 Ala Glu Phe Thr Tyr T h r T h r 

; o ;: o 10 2 5 10 3 0 

tct gtg ttt ggt gat gat aat aag gcg ctt tct aaa agt gaa ata 3285 
Ser Val Phe Gly Asp Asp Asn Lys Ala Leu Ser Lys Ser Glu lie 
1035 1040 1045 

ata tat gga aat gag act gaa ctg caa att cct tct ttc aat gag 3330 
lie Tyr Gly Asn Glu Thr Glu Leu Gin lie Pro Ser Phe Asn Glu 

1 - c; c; 1 0 6 0 



Asn Val Asn 



Lys 
10 8 0 



Leu Asn Ala 



Ser Leu 
1085 



Gin Glu Thr 



Ser Val Ser 

1090 



att 
lie 



tct age 
Ser Ser 



acc 
Thr 
1095 



aag ggc atg 
Lys Gly Met 



ttt cca 
Phe Pro 
1100 



ggg tec 
Gly Ser 



ctt get eat acc 
Leu Ala His Thr 
1105 



34 65 



acc act aag get ttt gat cat gag att agt caa gtt cca gaa aat 
Thr Thr Lys Val Phe Asp His Glu lie Ser Gin Val Pro Glu Asn 
1110 1115 1120 



3510 



aac ttt tea gtt caa cct aca cat act gtc tct caa gca tct ggt 
Asn Phe Ser Val Gin Pro Thr His Thr Val Ser Gin Ala Ser Gly 
1125 1130 1135 



3555 



gac act teg ctt aaa cct gtg ctt agt gca aac tea gag cca gca 
Asp Thr Ser Leu Lys Pro Val Leu Ser Ala Asn Ser Glu Pro Ala 
1140 1145 1150 



3600 



tec tct gac cct get tct agt gaa atg tta tct cct tea act cag 
Ser Ser Asp Pro Ala Ser Ser Glu Met Leu Ser Pro Ser Thr Gin 
1155 1160 1165 



3645 



etc tta ttt tat gag acc tea get tct ttt agt act gaa gta ttg 
Leu Leu Phe Tyr Glu Thr Ser Ala Ser Phe Ser Thr Glu Val Leu 
1170 1175 1180 



3690 



eta caa cct tec ttt cag get tct gat gtt gac acc ttg ctt aaa 
Leu Gin Pro Ser Phe Gin Ala Ser Asp Val Asp Thr Leu Leu Lys 
1185 1190 1195 



3735 



act gtt ctt cca get gtg ccc agt gat cca ata ttg gtt gaa acc 
Thr Val Leu Pro Ala Val Pro Ser Asp Pro lie Leu Val Glu Thr 
1200 1205 1210 



3780 



ccc aaa gtt gat aaa att agt tct aca atg ttg cat etc att gta 
Pro Lys Val Asp Lys lie Ser Ser Thr Met Leu His Leu lie Val 
1215 1220 1225 



3825 



tea aat tct 
Ser Asn Ser 



gct 
Ala 
1230 



tea agt gaa 

Ser Ser Glu 



aac atg ctg cac tct aca tct gta 
Asn Met Leu His Ser Thr Ser Val 
12 3 5 124 0 



3870 



cca gtt ttt gat gtg teg cct act tct cat atg cac tct get tea 
Pro Val Phe Asp Val Ser Pro Thr Ser His Met His Ser Ala Ser 
1245 1250 1255 



3 915 



ctt 
Leu 



caa ggt 

Gin Glv 



ttg 

Leu 

126C 



acc att tec 
Thr He Ser 



tat gca 
Tyr Ala 

126 5 



agt gag aaa tat gaa cca 

Ser Glu Lvs Tvr Glu Pro 



3960 



1290 



1295 



1300 



cag gcc cat ccc cca aaa gga agg cat gta ttt get aca cct gtt 4095 

Gin Ala His Pro Pro Lys Gly Arg His Val Phe Ala Thr Pro Val 

1305 1310 1315 

tta tea att gat gaa cca tta aat aca eta ata aat aag ctt ata 4140 

Leu Ser lie Asp Glu Pro Leu Asn Thr Leu lie Asn Lys Leu lie 

1320 1325 13 3 0 

cat tec gat gaa att tta ace tec aec aaa agt tct gtt act ggt 4185 

His Ser Asp Glu lie Leu Thr Ser Thr Lys Ser Ser Val Thr Gly 

1335 1340 1345 

aag gta ttt get ggt att cca aca gtt get tct gat aca ttt gta 4230 

Lys Val Phe Ala Gly lie Pro Thr Val Ala Ser Asp Thr Phe Val 

1350 1355 1360 

tct act gat cat tct gtt cet ata gga aat ggg cat gtt gcc att 4275 

Ser Thr Asp Hrs Ser Val Pro lie Gly Asn Gly His Val Ala lie 

1365 1370 1375 

aca get gtt tct ccc cac aga gat ggt tct gta ace tea aca aag 4320 

Thr Ala Val Ser Pro His Arg Asp Gly Ser Val Thr Ser Thr Lys 

1380 13&5 1390 

ttg ctg ttt cet tct aag gca act tct gag etg agt cat agt gcc 4365 

Leu Leu Phe Pro Ser Lys Ala Thr Ser Glu Leu Ser His Ser Ala 

1395 1400 1405 

aaa tct gat gcc ggt tta gtg ggt ggt ggt gaa gat ggt gac act 4410 

Lys Ser Asp Ala Gly Leu Val Gly Gly Gly Glu Asp Gly Asp Thr 

1410 1415 1420 

gat gat gat ggt gat gat gat gat gac aga gat agt gat ggc tta 4455 

Asp Asp Asp Gly Asp Asp Asp Asp Asp Arg Asp Ser Asp Gly Leu 

1425 1430 1435 

tec att cat aag tgt atg tea tgc tea tec tat aga gaa tea cag 4500 

Ser lie His Lys Cys Met Ser Cys Ser Ser Tyr Arg Glu Ser Gin 

1440 1445 1450 

gaa aag gta atg aat gat tea gac aco cac aaa aac agt ctt atg 4 5^5 

Glu Lys Val Met Asn Asp Ser Asp Thr His Glu Asn Ser Leu Met 

14 5 5 14 6 0 14 6 5 

gat cag aat aat cca ate tea tac tea eta tct gag aat tct gaa 4590 

Asp Gin Asn Asn Pro lie Ser Tyr Ser Leu Ser Glu Asn Ser Glu 

1470 1475 1480 

aaa aat aat aga ate aca aat ata tec tea qac aqt caa act ggt 46 3 5 



caa aag cac aat gat gga aaa gag gaa aat gac att cag act ggt 4725 

Gin Lys His Asn Asp Gly Lys Glu Glu Asn Asp He Gin Thr Gly 

1515 1520 1525 

agt get ctg ctt cct etc age cct gaa tct aaa gca tgg gca gtt 4770 

Ser Ala Leu Leu Pro Leu Ser Pro Glu Ser Lys Ala Trp Ala Val 

1530 1535 1540 

ctg aca agt gat gaa gaa agt gga tea ggg caa ggt ace tea gat 4815 

Leu Thr Ser Asp Glu Glu Ser Gly Ser Gly Gin Gly Thr Ser Asp 

1545 1550 1555 

age ctt aat gag aat gag act tec aca gat ttc agt ttt gca gac 4860 

Ser Leu Asn Glu Asn Glu Thr Ser Thr Asp Phe Ser Phe Ala Asp 

1560 1565 1570 

act aat gaa aaa gat get gat ggg ate ctg gca gca ggt gac tea 4 905 

Thr Asn Glu Lys Asp Ala Asp Gly He Leu Ala Ala Gly Asp Ser 

1575 1580 1585 

gaa ata act cct gga ttc cca cag tec cca aca tea tct gtt act 4950 

Glu He Thr Pro Gly Phe Pro Gin Ser Pro Thr Ser Ser Val Thr 

1590 1595 1600 

age gag aac tea gaa gtg ttc cac gtt tea gag gca gag gec agt 4 995 

Ser Glu Asn Ser Glu Val Phe His Val Ser Glu Ala Glu Ala Ser 

1605 1610 1615 

aat agt age cat gag tct cgt att ggt eta get gag ggg ttg gaa 504 0 

Asn Ser Ser His Glu Ser Arg He Gly Leu Ala Glu Gly Leu Glu 

1620 1625 1630 

tec gag aag aag gca gtt ata ece ctt gtg ate gtg tea gec ctg 5085 

Ser Glu Lys Lys Ala Val He Pro Leu Val He Val Ser Ala Leu 

1635 1640 1645 

act ttt ate tgt eta gtg gtt ctt gtg ggt att etc ate tac tgg 5130 

Thr Phe He Cys Leu Val Val Leu Val Gly He Leu lie Tyr Trp 

I >f 5 0 1655 16 60 

agg aaa tgc ttc cag act gca car ttt tac tta gag gar agt aca 5175 

Arg Lys Cys Phe Gin Thr Ala His Phe Tyr Leu Glu Asp Ser Thr 

I I 6 5 16 7 0 16 7 5 

tec cct aga gt.t ata tec aca cct: cca aca cct ate ttt cca att 5220 

Ser Pro Arg Val He Ser Thr Pro Pro Thr Pro He Phe Pro He 

1680 1685 1690 

tea gat gat gtc gga gca att cca ata aag cac ttt cca aag cat 5265 

Ser Arc Asp Val Glv Ala He Pro lie Lys His Phe Pro Lys His 



aca ctg aaa gag ttt tac cag gaa gtg cag age tgt act gtt gac 5355 

Thr Leu Lys Glu Phe Tyr Gin Glu Val Gin Ser Cys Thr Val Asp 

1725 1730 1735 

tta ggt att aca gca gac age tec aac cac cca gac aac aag cac 5400 

Leu Gly lie Thr Ala Asp Ser Ser Asn His Pro Asp Asn Lys His 

1740 1745 1750 

aag aat cga tac ata aat ate gtt gee tat gat cat age agg gtt 5445 

Lys Asn Arg Tyr lie Asn lie Val Ala Tyr Asp His Ser Arg Val 

1755 1760 1765 

aag eta gca cag ctt get gaa aag gat ggc aaa ctg act gat tat 5490 

Lys Leu Ala Gin Leu Ala Glu Lys Asp Gly Lys Leu Thr Asp Tyr 

1770 1775 1780 

ate aat gee aat tat gtt gat ggc tac aac aga cca aaa get tat 5535 

lie Asn Ala Asn Tyr Val Asp Gly Tyr Asn Arg Pro Lys Ala Tyr 

1785 1790 1795 

att get gee caa ggc cca ctg aaa tec aca get gaa gat ttc tgg 5580 

lie Ala Ala Gin Gly Pro Leu Lys Ser Thr Ala Glu Asp Phe Trp 

1800 1805 1810 

aga atg ata tgg gaa cat aat gtg gaa gtt att gtc atg ata aca 5625 

Arg Met He Trp Glu His Asn Val Glu Val He Val Met He Thr 

1815 1820 1825 

aac etc gtg gag aaa gga agg aga aaa tgt gat cag tac tgg cct 5670 

Asn Leu Val Glu Lys Gly Arg Arg Lys Cys Asp Gin Tyr Trp Pro 

1830 1835 1840 

gee gat ggg agt gag gag tac ggg aac ttt ctg gtc act cag aag 5715 

Ala Asp Gly Ser Glu Glu Tyr Gly Asn Phe Leu Val Thr Gin Lys 

1845 1850 1855 

agt gtg caa gtg ctt gee tat tat act gtg agg aat ttt act eta 5760 

Ser Val Gin Val Leu Ala Tyr Tyr Thr Val Arg Asn Phe Thr Leu 

1860 1865 1870 

aga aac aca aaa ata aaa aag jgc tec cag aaa gga aga cc : agt 5805 

Arg Asn Thr Lys He Lye Lys Gly Ser Gin Lys Gly Arg Pro Ser 

18 7 5 18 80 1 8 :~ 5 

gga cgt gtg gtc: aca eag tat :ac tac acg cag tgg cct gac at.g 5 85 0 

Gly Arg Val Val Thr Gin Tyr His Tyr Thr Gin Trp Pro Asp> Met 

1890 1895 190 0 

gga gta oca gag tac tec ctg cca gtg ctg ace ttt gtg aga aag 5895 

Gly Val Pro Glu Tyr Ser Leu Pro Val Leu Thr Phe Val Arg Lys 

19 0 5 1910 1915 



Cvs Ser Ala 



Gly 
1935 



Val Gly Arg 



Thr Gly 
1940 



Thr Tyr He 



Val Leu Asp 
1945 



agt atg ttg 
Ser Met Leu 



cag 
Gl n 
195 0 



cag att 
Gin He 



caa 
Gin 



cac gaa 
His Glu 
1955 



gga act gtc 
Gly Thr Val 



aac ata ttt 
Asn He Phe 
1960 



6030 



ggc ttc tta 
Gly Phe Leu 



aaa 
Lys 
196 5 



cac ate 
His He 



cgt 
Arg 



tea caa 
Ser Gin 
1970 



aga aat tat 
Arg Asn Tyr 



ttg gta caa 
Leu Val Gin 
1975 



6075 



act gag gag 
Thr Glu Glu 



caa 
Gin 
1980 



tat gtc 
Tyr Val 



ttc 
Phe 



att cat 
He His 
1985 



gat aca ctg 
Asp Thr Leu 



gtt gag gec 
Val Glu Ala 

19 90 



6120 



ata ctt agt 
He Leu Ser 



aaa 
Lys 
1995 



gaa act 
Glu Thr 



gag 
Glu 



gtg ctg 
Val Leu 
2000 



gac agt cat 
Asp Ser His 



att cat gec 
He His Ala 
2 00 5 



6165 



tat gtt aat 
Tyr Val Asn 



gca 
Ala 
2 010 



etc etc 
Leu Leu 



att 
He 



cct gga 
Pro Gly 
2015 



cca gca ggc 
Pro Ala Gly 



aaa aca aag 
Lys Thr Lys 
2 02 0 



6210 



eta gag aaa 
Leu Glu Lys 



caa 
Gin 
2025 



ttc cag 
Phe Gin 



etc 
Leu 



ctg age 
Leu Ser 
2030 



cag tea aat 
Gin Ser Asn 



ata cag cag 
He Gin Gin 

2 0 3 5 



6255 



agt gac tat 
Ser Asp Tyr 



tct 
Ser 
2 04 0 



gca gec 
Ala Ala 



eta 
Leu 



aag caa 
Lys Gin 
2045 



tgc aac agg 
Cys Asn Arg 



gaa aag aat 
Glu Lys Asn 

2050 



6300 



cga act tct 
Arg Thr Ser 



tct 
Ser 
2055 



ate ate 
He He 



cct 
Pro 



gtg gaa 
Val Glu 
2 06 0 



aga tea agg 
Arg Ser Arg 



gtt ggc att 
Val Gly He 
2065 



6345 



tea tec ctg 
Ser Ser Leu 



agt 
Ser 
2 07 0 



gga gaa 
Gly Glu 



ggc 
Gly 



aca gac 
Thr Asp 
2075 



tac ate aat 
Tyr lie Asn 



gec tec tat 
Ala Ser Tyr 
20S0 



6390 



ate atg ggc 

He Me:. Giv 



tat 

Ty i 

:: 0 8 5 



tac cag 
Tvr Gl:i 



age 
Se t" 



aat qaa 

Asn Glu 
2090 



ttc ate att 

Phe He He 



acc caa cac 

Tb >■ (\ 1 r> U -i <- 



0 95 



64 3 5 



c c t c t c ctt 
Pro Leu Leu 



cat 
His 
2 100 



acc ate 
Thr He 



aag 
Lys 



gat t tc 
Asp Phe 
210 5 



tgg agg atg 
Trp Arg Met 



ata tgg gac 
lie Trp Asp 
2 110 



64 8 0 



cat aat gee 
His Asn Ala 



caa 

G 1 n 
2115 



ctg gtg 

Leu Val 



gtt 

Val 



atg att. 
Met lie 



cct gat ggc 

Pro Asp Gly 



caa aac atg 

Gin Asn Met 

2 12 5 



6525 



2145 2150 2155 

tgt eta tct aat gag gaa aaa ctt ata att cag gac ttt ate tta 6660 

3ys Leu Ser Asn Glu Glu Lys Leu lie lie Gin Asp Phe He Leu 

2160 2165 2170 

gaa get aca cag gat gat tat gta ctt gaa gtg agg cac ttt cag 6705 

Glu Ala Thr Gin Asp Asp Tyr Val Leu Glu Val Arg His Phe Gin 

2175 2180 2185 

tgt cct aaa tgg cca aat cca gat age ccc att agt aaa act ttt 6750 

Cys Pro Lys Trp Pro Asn Pro Asp Ser Pro He Ser Lys Thr Phe 

2190 2195 2200 

gaa ctt ata agt gtt ata aaa gaa gaa get gee aat agg gat ggg 6795 

Glu Leu He Ser Val He Lys Glu Glu Ala Ala Asn Arg Asp Gly 

2205 2 2 10 2215 

cct atg att gtt cat gat gag cat gga gga gtg acg gca gga act 6840 

Pro Met He Val His Asp Glu His Gly Gly Val Thr Ala Gly Thr 

2220 2225 2230 

ttc tgt get ctg aca ace ctt atg cac caa eta gaa aaa gaa aat 6885 

Phe Cys Ala Leu Thr Thr Leu Met His Gin Leu Glu Lys Glu Asn 

2 2 35 2240 2245 

tec gtg gat gtt tac cag gta gec aag atg ate aat ctg atg agg 6930 

Ser Val Asp Val Tyr Gin Val Ala Lys Met He Asn Leu Met Arg 

2250 2255 2260 

cca gga gtc ttt get gac att gag cag tat cag ttt etc tac aaa 6975 

Pro Gly Val Phe Ala Asp He Glu Gin Tyr Gin Phe Leu Tyr Lys 

2265 2270 2275 

gtg ate etc age ctt gtg age aca agg cag gaa gag aat cca tec 7020 

Val He Leu Ser Leu Val Ser Thr Arg Gin Glu Glu Asn Pro Ser 

2280 2285 2290 

ace tct ctg gac agt aat ggt gca gca ttg cct gat gga aat ata 7065 

Thr Ser Leu Asp Ser Asn Gly Ala Ala Leu Pro Asp Gly Asn lie 

2 2 9 5 . : 3 0 0 3 0 5 

art. gaa age tta gag tct tta att taa :'icaaaaaqa aataggaaga " : : 2 

Ala Giu Ser Leu Glu Ser Leu Val 
2310 

ctcacatctg agcattgttt tcctcttcct aaaattaggc aggaaaatca gtctagttct 7i^ T 2 

gttatctgtt gatttcccat cacctgacag taactttcat gaeataggat tctgecgeca 7232 



aattratate a 1 1 a a c a a t a 



rotacctttt vacaaaactt ataatttact tat.tatat.tt 



7 2 92 



tataactttt aatacagtag cctgtaaata aaacactctt ccatatgata ttcaacattt 7532 

tacaactgca gtattcacct aaagtagaaa taatctgtta cttattgtaa atactgccct 7592 

agtgtctcca tggaccaaat ttatatttat aattgtagat ttttatattt tactactgag 7652 

tcaagttttc tagttctgtg taattgttta gtttaatgac gtagttcatt agctggtctt 7712 

actctaccag ttttctgaca ttgtattgtg ttacctaagt cattaacttt gtttcagcat 7772 

gtaattttaa cttttgtgga aaatagaaat accttcattt tgaaagaagt ttttatgaga 7832 

ataacacctt accaaacatt gttcaaatgg tttttatcca aggaattgca aaaataaata 7892 

taaatattgc cattaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaa 7941 



<211> 2314 
<212> PRT 



<213> Homo sapiens 
< 2 2 0 > 



< J. 

< 2 



1> gene 

2> (1) . . (2314) 

3> PTP-zeta 

0> 

1> SIGNAL 

2> (1) . . (24) 

3> By similarity 

0> 

I> DOMAIN 

2> (25) . . (1635) 

3 > Extracellular (potential) 

z 2 2 0 > 

:22 1> MI 3 C FEATURE 

2. 1 :■ ( 2 5 ) . . ( 2314 } 

2, . ■ Mat ure chain; protein- tyrosine phosphatase zeta 



DOMAIN 
(34) . . (302) 

Carbonic-anhydrase like 



SITE 

( 105 } . . C 105 ) 



2^0- 

2 2 1 : 

Ti")- 
7 '•; ■ 



SITE 

(223) . . (223) 

Glvcosvlat ion ; N- linked (GLCIJAC. 



) (potential; 



: 2 2 1 



SITE 

(2 32) .. (232) 

Glycosylation; IJ- linked (GLCIJAC. 



(potential ) 



< j. _ 'j . 

< 2 2 2 : 



SITE 

(324) . . ( 324) 

Glycosylation; IJ- linked (GLCIIAC . 



. ) (potent ial ) 



<221> DOMAIN 

<222:- (3 12) . . (406) 

<22 3> Fibronectin Type -III 



< 2 2 0 > 

< 2 2 1 > 
<222 > 

< 2 2 > > 



SITE 

(38 1) . . (381) 

Glycosylation; IJ- linked (GLCIJAC. 



(potential ) 



< 2 2 0 > 

< 2 2 1 > 



SITE 

(4?7) . . (497) 

Glycosylation; IJ- linked (GLCIJAC . 



{potential ] 



<220> 

< 2 2 1 > 

< _ 2 J, > 

< 2 2 3 > 



SITE 

(5C1) . . (501) 

Glycosylation; IJ- linked ( GLCIJAC . 



(potential ) 



< j j. i 1 > 



< _ ^ i > d 



ITE 

(552) . . (552) 

21 vcosvl a t ion ; N - 1 1 nked f GLCIJAC . 



) ( potent ial 1 



31 NCING 
(5r.7j . . (567; 

2iiondrrroitin Suitate \ potent ial , 



. ^2 3 



SITE 

(602) . . (602) 
Glvcosvlat ion ; N 1 



inked , GLCN; 



; potent : a 1 






BINDING 
(637) . . (637) 

Chondroitm sulfate (potential) 



.) > 



SITE 

(6^7) . . (677) 

Glycosylation; N- linked (GLCNAC. . . ) (potential) 



1> VARSPLIC 

2> (755) . . (1614) 

3> Splicing variant; missing (in short isoform) 



<;: 2 1> BINDING 

<222> (997) . . (997) 

<223> Chondroitin sulfate (potential) 
< 2 2 0 > 

<221> SITE 

<222> (1017) . . (1017) 

<223> Glycosylation; N-linked (GLCNAC...) (potential) 



<221> SITE 

<222> (1050) . . (1050) 

<223 > Glycosylation; N-linked (GLCNAC...) (potential) 



<221> SITE 

<222> (1082) . . (1082 ) 

<223> Glycosylation; N-linked (GLCNAC...) (potential) 



<221> SITE 

<222> (1122) . . (1122) 

<223> Glycosylation; N-linked (GLCNAC...) (potential) 



< 2 2 0 > 




. ( 14 5(i ) 



lat. i on ; 



potential 



<::?.!> BINDING 

■;222> (1548) . . {1548 ) 

■;223> Chondroitin sulfate (potential) 



221 



BINDING 



<223> Glycosylation; N-lmked ( GLCNAC . . . ) (potential) 

< 2 2 0 > 

<22I> SITE 

< 2 2 2 > ( 1617 ) . . ( 16 17 ) 

<223> Glycosylation; N-lmked (GLCNAC...) (potential) 



< 2 2 0 > 

< 2 2 1 > TRAN3MEM 
<:222> (1636) . . (1661) 

<223> Transmembrane region; potential 



< 2 2 0 > 

<221> DOMAIN 

<222> (1662) . . (2314) 

<223> Cytoplasmic (potential) 



< 2 2 0 > 

< 2 2 1 > CONFLICT 
<222> (1722) . . (1728) 
<223> Missing (m ref . 2) 



< 2 2 ( J > 

<2 21> DOMAIN 

<222> (1744) . . (1997) 

<223> Protein- tyrosine phosphates 



< 2 2 0 > 

<221> ACT_SITE 

<222> (1932) . . (1932) 

<22 3> Active site; by similarity 



< 2 2 0 > 

<221> DOMAIN 

<222> (1998) . . (2314) 

<223> Protein- tyrosine phosphatase 



<220> 

< 2 2 1 > MI SC_ FEATURE 

■= 2 2 2 > ( 2 2 2 2 ) . . ( 2 2 2 2 ) 

■■ . ' ."• ^ ■ An central a c t: l ve s ice 



Krueger, N.X. and Gdilo, H. 
< _■' '.'2> A human transmembrane protein- tyrosine -phosphatase , PTP zeta, is 
expressed in brain and has an N- terminal receptor domain homologous to 
carbonic anhydrases 

<303> Proc . Natl. Acad. Sci . USA 
•■ • - "'4 > S 9 

<r 3 -J 5 > 16 





<300> 

<3 01> Levy, J . B . , et al . 

<302> The cloning of a receptor- type protein tyrosine phosphatase expressed 

in the central nervous system 

<303:- J. Biol. Chem. 

<304> 268 

<3 05:- 14 

<3 06> 10573-10581 

< 3 0 7 > 1993 
<3 09> 

<313> (1) . . (2314) 



<400> 



6 



Met 7^rg lie Leu Lys Arg Phe Leu Ala Cys lie Gin Leu Leu Cys Val 
1 5 10 15 



Cys Arg Leu Asp Trp Ala Asn Gly Tyr Tyr Arg Gin Gin Arg Lys Leu 
2 0 2 5 3 0 



Val Glu Glu lie Gly Trp Ser Tyr Thr Gly Ala Leu Asn Gin Lys Asn 
35 40 45 



Trp Gly Lys Lys Tyr Pro Thr Cys Asn Ser Pro Lys Gin Ser Pro lie 
50 55 60 



Asn lie Asp Glu Asp Leu Thr Gin Val Asn Val Asn Leu Lys Lys Leu 
65 70 75 80 



Lys Phe Gin Gly Trp Asp Lys Thr Ser Leu Glu Asn Thr Phe lie His 
85 9C 95 



Asn Thr Gly Lys Thr Val Glu lie Asn Leu Thr Asn Asp Tyr Arg Val 

10 0 10 5 110 



Val Ser Glu Met Val Phe Lys Aid Ser 



lie Thr Phe 



lib 



His Trp Gly Lys Cys Asn Met Ser Ser Asp Gly Ser Glu His Ser Leu 

130 135 140 



Arg Ala Leu Ser He Leu Phe Glu Val Gly Thr Glu Glu Asn Leu Asp 

180 185 190 



Phe Lys Ala He He Asp Gly Val Glu Ser Val Ser Arg Phe Gly Lys 
195 200 205 



Gin Ala Ala Leu Asp Pro Phe He Leu Leu Asn Leu Leu Pro Asn Ser 
210 215 220 



Thr Asp Lys Tyr Tyr lie Tyr Asn Gly Ser Leu Thr Ser Pro Pro Cys 
225 230 235 240 



ihi Asp ihr Val /\sp irp lie Val Phe L.ys Asp Thr Val Ser He Ser 
245 250 255 



Glu Ser Gin Leu Ala Val Phe Cys Glu Val Leu Thr Met Gin Gin Ser 
260 265 270 



Gly Tyr Val Met Leu Met Asp Tyr Leu Gin Asn Asn Phe Arg Glu Gin 
275 280 285 



Gin Tyr Lys Phe Ser Arg Gin Val Phe Ser Ser Tyr Thr Gly Lys Glu 
290 295 300 



Glu He His Glu Ala Val Cys Ser Ser Glu Pro Glu Asn Val Gin Ala 
305 310 315 320 



Asp Pro Glu Asn Tyr Thr Ser Leu Leu Val Thr Trp Glu Arg Pro Arg 

3 2 5 3 3 0 3 3 5 



Tyr Asp Thr Met He Glu Lys Phe Ala Val Leu Tyr Gin Gin 
3 4 0 3 4 5 3 5 C 



Leu Asp Gly Glu Asp Gin Thr Lys His Glu Phe Leu Thr Asp Gly Tyr 
355 360 365 



Gin Asp Leu Gly Ala lie Leu Asn Asn Leu Leu Pro Asn Met. Ser Tyr 



Ser Asp Gin Leu lie Val Asp Met Pro Thr Asp Asn Pro Glu Leu Asp 
405 410 415 



Leu Phe Pro Glu Leu lie Gly Thr Glu Glu lie lie Lys Glu Glu Glu 
420 425 430 



Glu Gly Lys Asp lie Glu Glu Gly Ala lie Val Asn Pro Gly Arg Asp 
435 440 445 



Ser Ala Thr Asn Gin lie Arg Lys Lys Glu Pro Gin lie Ser Thr Thr 
450 455 460 



Thr His Tyr Asn Arg lie Gly Thr Lys Tyr Asn Glu Ala Lys Thr Asn 
465 470 475 480 



Arg Ser Pro Thr Arg Gly Ser Glu Phe Ser Gly Lys Gly Asp Val Pro 
485 490 495 



Asn Thr Ser Leu Asn Ser Thr Ser Gin Pro Val Thr Lys Leu Ala Thr 
500 505 510 



Glu Lys Asp lie Ser Leu Thr Ser Gin Thr Val Thr Glu Leu Pro Pro 
515 520 525 



His Thr Val Glu Gly Thr Ser Ala Ser Leu Asn Asp Gly Ser Lys Thr 

530 535 540 



Val Leu Arg Ser Pro His Met Asn Leu Ser Gly Thr Ala Glu Ser Leu 
545 550 555 560 



Asn Thr Val Ser lie Thr Glu Tyr Glu Glu Glu Ser Leu Leu Thr Ser 



Phe Lys Leu Asp Thr Gly Ala Glu Asp Ser Ser Gly Ser Ser Pro Ala 
5 80 585 590 



Thr- Ser Ala lie Pro Phe He Ser Glu Asn He Ser Gin Gly Tyr He 
5 9 5 6 0 0 6 0 5 



Glu Ser Ala Arg Asn Ala Ser Glu Asp Ser Thr Ser Ser Gly Ser Glu 
625 630 635 640 



Glu Ser Leu Lys Asp Pro Ser Met Glu Gly Asn Val Trp Phe Pro Ser 
645 650 655 



Ser Thr Asp lie Thr Ala Gin Pro Asp Val Gly Ser Gly Arg Glu Ser 
660 665 670 



Phe Leu Gin Thr Asn Tyr Thr Glu lie Arg Val Asp Glu Ser Glu Ly: 
675 68 0 685 



Thr Thr Lys Ser Phe Ser Ala Gly Pro Val Met Ser Gin Gly Pro Ser 
690 695 700 



Val Thr Asp Leu Glu Met Pro His Tyr Ser Thr Phe Ala Tyr Phe Pro 
705 710 715 72 0 



Thr Glu Val Thr Pro His Ala Phe Thr Pro Ser Ser Arg Gin Gin Asp 

725 7 30 735 



Leu Val Ser Thr Val Asn Val Val Tyr Ser Gin Thr Thr Gin Pro Val 
740 745 750 



Tyr Asn Gly Glu Thr Pro Leu Gin Pro Ser Tyr Ser Ser Glu Val Phe 
755 760 765 



Pro Leu Val Thr Pro Leu Leu Leu Asp Asn Gin lie Leu Asn Thr Thr 
770 775 780 



Pro Ala Ala Ser Ser Ser Asp Ser Ala Leu His Ala Thr Pro Val Phe 
78 5 7 90 7 95 8 00 



Pro Ser Val Asp Val Ser Phe Glu Ser lie Leu Ser Ser Tyr Asp Gly 
805 810 815 



Ala Pro Leu Leu Pro Phe Ser Ser Ala. Ser Phe Ser Ser Glu Leu Phe 

8 2 0 8 2 5 8 3 0 



850 855 860 



Gly Asp Leu Leu Leu Glu Pro Ser Leu Ala Gin Tyr Ser Asp Val Leu 
865 870 375 880 



Ser Thr Thr His Ala Ala Ser Glu Thr Leu Glu Phe Gly Ser Glu Ser 
885 890 895 



Gly Val Leu Tyr Lys Thr Leu Met Phe Ser Gin Val Glu Pro Pro Ser 
900 905 910 



Ser Asp Ala Met Met His Ala Arg Ser Ser Gly Pro Glu Pro Ser Tyr 
915 920 925 



Ala Leu Ser Asp Asn Glu Gly Ser Gin His lie Phe Thr Val Ser Tyr 
930 935 940 



Ser Ser Ala lie Pro Val His Asp Ser Val Gly Val Thr Tyr Gin Gly 
945 950 955 960 



Ser Leu Phe Ser Gly Pro Ser His lie Pro lie Pro Lys Ser Ser Leu 
965 970 975 



lie Thr Pro Thr Ala Ser Leu Leu Gin Pro Thr His Ala Leu Ser Gly 
980 985 990 



Asp Gly Glu Trp Ser Gly Ala Ser Ser Asp Ser Glu Phe Leu Leu Pro 
995 1000 1005 



Asp Thr Asp Gly Leu Thr Ala Leu Asn lie Ser Ser Pro Val Ser 

1010 1015 102 0 



Val Ala Glu Phe Thr Tyr Thr Thr Ser Val Phe Gly Asp Asp Asn 
10 2 5 10 3 0 10 3 5 



Lys Ala Leu Ser Lys Ser Glu lie lie Tyr Gly Asn Glu Thr Glu 
104 0 104 5 10 50 



Ser Leu Gin Glu Thr Ser Val Ser He Ser Ser Thr Lys Gly Met 
1085 1090 10 95 



Phe Pro Gly Ser Leu Ala His Thr Thr Thr Lys Val Phe Asp His 
1100 1105 1110 



Glu He Ser Gin Val Pro Glu Asn Asn Phe Ser Val Gin Pre. Thr 
1115 1120 1125 



His Thr Val Ser Gin Ala Ser Gly Asp Thr Ser Leu Lys Pro Val 
1130 1135 1140 



Leu Ser Ala Asn Ser Glu Pro Ala Ser Ser Asp Pro Ala Ser Ser 
1145 1150 1155 



Glu Met Leu Ser Pro Ser Thr Gin Leu Leu Phe Tyr Glu Thr Ser 
1160 1165 1170 



Ala Ser Phe Ser Thr Glu Val Leu Leu Gin Pro Ser Phe Gin Ala 
1175 1180 1185 



Ser Asp Val Asp Thr Leu Leu Lys Thr Val Leu Pro Ala Val Pro 
1190 1195 1200 



Ser Asp Pro He Leu Val Glu Thr Pro Lys Val Asp Lys He Ser 
1205 1210 1215 



Ser Thr Met Leu His Leu He Val Ser Asn Ser Ala Ser Ser Glu 

1220 1225 1230 



Asn Met Leu Hi s Ser Thr Ser Va 1 Pre Va 1 Phe Asp Val Ser Pro 
1235 1240 1245 



Thr Ser His Met His Ser Ala Ser Leu Gin Gly Leu Thr He Ser 
1250 1255 1260 



Tyr Ala Ser Glu Lys Tyr Glu Pro Val Leu Leu Lys Ser Glu Ser 



Gin Thr Ala Asn Leu Glu lie Asn Gin Ala His Pro Pro Lys Gly 
1295 1300 1305 



Arg His Val Phe Ala Thr Pro Val Leu Ser lie Asp Glu Pro Leu 
1310 1315 1320 



Asn Thr Leu lie Asn Lys Leu lie His Ser Asp Glu lie Leu Thr 
1325 1330 13 3 5 



Ser Thr Lys Ser Ser Val Thr Gly Lys Val Phe Ala Gly lie Pro 
1340 1345 1350 



Thr Val Ala Ser Asp Thr Phe Val Ser Thr Asp His Ser Val Pro 
1355 1360 1365 



lie Gly Asn Gly His Val Ala lie Thr Ala Val Ser Pro His Arg 
1370 1375 1380 



Asp Gly Ser Val Thr Ser Thr Lys Leu Leu Phe Pro Ser Lys Ala 
1385 1390 1395 



Thr Ser Glu Leu Ser His Ser Ala Lys Ser Asp Ala Gly Leu Val 
1400 1405 1410 



Gly Gly Gly Glu Asp Gly Asp Thr Asp Asp Asp Gly Asp Asp Asp 
1415 1420 1425 



Asp Asp Arg Asp Ser Asp Gly Leu Ser lie His Lys Cys Met Ser 
1430 1435 1440 



~Vs Ser Ser Tyr Arg Glu Ser Gin Glu Lys Val Met Asn Asp Ser 
^445 i 4 5 1 1 4 [ :- 5 



Asp Thr His Glu Asn Ser Leu Met Asp Gin Asn Asn Pro lie Ser 
1460 1465 1470 



Tyi Ser Leu Ser Glu Asn Ser Glu Glu Asp Asn Arg Val Thi Ser 
14 7 5 14 8 0 14 8 5 



Ser Pro Ser Ala Asn Gly Leu Ser Gin Lys His Asn Asp Gly Lys 
1505 1510 1515 



Glu Glu Asn Asp lie Gin Thr Gly Ser Ala Leu Leu Pro Leu Ser 
1520 1525 1530 



Pro Glu Ser Lys Ala Trp Ala Val Leu Thr Ser Asp Glu Glu Ser 
1535 1540 1545 



Gly Ser Gly Gin Gly Thr Ser Asp Ser Leu Asn Glu Asn Glu Thr 
1550 1555 1560 



Ser Thr Asp Phe Ser Phe Ala Asp Thr Asn Glu Lys Asp Ala Asp 
1565 1570 1575 



Gly lie Leu Ala Ala Gly Asp Ser Glu lie Thr Pro Gly Phe Pro 
1580 1585 1590 



Gin Ser Pro Thr Ser Ser Val Thr Ser Glu Asn Ser Glu Val Phe 
1595 1600 1605 



His Val Ser Glu Ala Glu Ala Ser Asn Ser Ser His Glu Ser Arg 
1610 1615 1620 



He Gly Leu Ala Glu Gly Leu Glu Ser Glu Lys Lys Ala Val He 
1625 1630 1635 



Pro Leu Val He Val Ser Ala Leu Thr Phe He Cys Leu Val Val 
1640 1645 1650 



ijeu .a. biy i^e Leu ;ie ivi ; rp ^yg i/vs lys t'ne -^in mr nia 
16 5 5 16 6 0 16 6 5 



His Phe Tyr Leu Glu Asp Ser Thr Ser Pro Arg Val He Ser Thr 
16^0 1675 16 8 0 



Pro Pro Thr Pro He Phe Pro He Ser Asp Asp Val Gly Ala He 

16 3 5 16 9 0 16 9b 



1715 1720 1725 



;lu Val Gin Ser Cys Thr Val Asp Leu Gly lie Thr Ala Asp Ser 
17 3 0 17 3 5 17 4 0 



Ser Asn His Pro Asp Asn Lys His Lys Asn Arg Tyr lie Asn lie 
174 5 1 7 E- 0 1755 



Val Ala Tyr Asp His Ser Arg Val Lys Leu Ala Gin Leu Ala Glu 
1760 1765 1770 



Lys Asp Gly Lys Leu Thr Asp Tyr lie Asn Ala Asn Tyr Val Asp 
1775 1760 1785 



Gly Tyr Asn Arg Pro Lys Ala Tyr lie Ala Ala Gin Gly Pro Leu 
1790 1795 1800 



Lys Ser Thr Ala Glu Asp Phe Trp Arg Met lie Trp Glu His Asn 
1805 1810 1815 



Val Glu Val lie Val Met lie Thr Asn Leu Val Glu Lys Gly Arg 
1820 1825 1830 



Arg Lys Cys Asp Gin Tyr Trp Pro Ala Asp Gly Ser Glu Glu Tyr 
1835 1840 1845 



Gly Asn Phe Leu Val Thr Gin Lys Ser Val Gin Val Leu Ala Tyr 
1850 1855 1860 



Tyr Thr Val Arg Asn Phe Thr Leu Arg Asn Thr Lys lie Lys Lys 

18 6 5 18 7 G 1 8 n 5 



Gly Ser Gin Lys Gly Arg Pro Ser Gly Arg Val Val Thr Gin Tyr 
188C 1 8 £ 5 18 9 0 



His Tyr Thr Gin Trp Pro Asp Met Gly Val Pro Glu Tyr Ser Leu 

18 9 5 19 0 0 19 05 



Thr Gly Thr Tyr lie Val Leu Asp Ser Met Leu Gin Gin lie Gin 
1940 1945 1950 



His Glu Gly Thr Val Asn lie Phe Gly Phe Leu Lys His lie Arg 
1955 1960 1965 



Ser Gin Arg Asn Tyr Leu Val Gin Thr Glu Glu Gin Tyr Val Phe 
1970 1975 1980 



lie His Asp Thr Leu Val Glu Ala lie Leu Ser Lys Glu Thr Glu 
1985 1990 19 9 5 



Val Leu Asp Ser His lie His Ala Tyr Val Asn Ala Leu Leu lie 
2 00 0 2 00 5 2 010 



Pro Gly Pro Ala Gly Lys Thr Lys Leu Glu Lys Gin Phe Gin Leu 
2015 2020 2025 



Leu Ser Gin Ser Asn lie Gin Gin Ser Asp Tyr Ser Ala Ala Leu 
2 03 0 2 03 5 2 04 0 



Lys Gin Cys Asn Arg Glu Lys Asn Arg Thr Ser Ser lie lie Pro 
2045 2050 2055 



Val Glu Arg Ser Arg Val Gly lie Ser Ser Leu Ser Gly Glu Gly 
2060 2065 2070 



Thr Asp Tyr lie Asn Ala Ser Tyr lie Met Gly Tyr Tyr Gin Ser 

2 0 7 5 2 0 8 0 2 0 8 5 



Asii Glu Phe lie lie 7:~n Gin His Pre Leu Leu His 
S.C90 2095 2100 



Asp Phe Trp Arg Met lie Trp Asp His Asn Ala Gin Leu Val Val 
2105 2110 2115 



Met Tie Pro Asp Gly Gin Asn Met Ala Glu Asp Glu Phe Val Tyr 



Thr Leu Met Ala Glu Glu His Lys Cys Leu Ser Asn Glu Glu Lys 
2150 2155 2160 



Leu lie lie Gin Asp Phe lie Leu Glu Ala Thr -.Jin Asp Asp Tyr 
2165 2170 2175 



Val Leu Glu Val Arg His Phe Gin Cys Pro Lys Trp Pro Asn Pro 
2180 2185 2190 



Asp Ser Pro lie Ser Lys Thr Phe Glu Leu lie Ser Val lie Lys 
219 5 2 2 0 0 2 2 0 5 



Glu Glu Ala Ala Asn Arg Asp Gly Pro Met lie Val His Asp Glu 
2 210 2 2 1 5 2 2 2 0 



His Gly Gly Val Thr Ala Gly Thr Phe Cys Ala Leu Thr Thr Leu 
2225 2230 2235 



Met His Gin Leu Glu Lys Glu Asn Ser Val Asp Val Tyr Gin Val 
2240 2245 2250 



Ala Lys Met lie Asn Leu Met Arg Pro Gly Val Phe Ala Asp lie 
2255 2260 2265 



Glu Gin Tyr Gin Phe Leu Tyr Lys Val lie Leu Ser Leu Val Ser 
2270 2 2 7 5 22S0 



Thr Arg Gin Glu Glu Asn Pro Ser Thr Ser Leu Asp Ser Asn Gly 
2285 2290 2295 



Ala Ala Leu Pro Asp Gly Asn lie Ala Glu Ser Leu Glu Ser Leu 



/al 
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<223> Angiopoietm-like 2 ( ANGPTL2 ) , mRNA 

<220> 

<221:> CDS 

<222> (22) . . (1503) 

<22 3> 

< 3 0 0 > 

<301> Kim, I., Moon, S.O., Koh, K.N., Kim, H., Uhm , C.S., Kwak, H.J., Kim, 
N . G . and Koh , G . Y . 

<:302> Molecular cloning, expression, and characterization of angiopoiet in- 
related protein, angiopoiet in- related protein induces endothelial cell 
sprouting 

<303> J. Biol. Chem. 
<304 > 274 
<305> 37 

<306> 26523-26528 
<307> 1999 
<309> 

<313> (1) . . (1518) 



<400> 7 

aaccaccatt ttgcaaggac c atg agg cca ctg tgc gtg aca tgc tgg tgg 51 

Met Arg Pro Leu Cys Val Thr Cys Trp Trp 

15 10 

etc gga ctg ctg get gec atg gga get gtt gca ggc cag gag gac ggt 99 
Leu Gly Leu Leu Ala Ala Met Gly Ala Val Ala Gly Gin Glu Asp Gly 
15 20 25 

ttt gag ggc act gag gag ggc teg cca aga gag ttc att tac eta aac 147 
Phe Glu Gly Thr Glu Glu Gly Ser Pro Arg Glu Phe lie Tyr Leu Asn 
30 35 40 

agg tac aag egg gcg ggc gag tec cag gac aag tgc acc tac acc ttc 195 
Arg Tyr Lys Arg Ala Gly Glu Ser Gin Asp Lys Cys Thr Tyr Thr Phe 
45 50 55 

att ctg ccc cag cag egg gtc acg ggt gec ate tgc gtc aac tec aag 24 3 

lie Val Pro Gin Gin Arg Val Thr Gly Ala lie Cys Val Asn Ser Lys 
>: 0 6 5 7 0 

gag cct gag gtg ctt ctg gag aac ^ga gtg cat aag cag gag eta gag 2 91 

Glu Pro Glu Val Leu Leu Glu Asn Arg Val His Lys Gin Glu Leu Glu 
75 80 85 90 

ctg etc aac aat gag ctg :tc aag cag aag egg cag ate gag aca ctg 339 
Leu Leu Asn Asn Glu Leu Leu Lys Gin Lys Arg Gin lie Glu Thr Leu 

9 5 1 1 DO 10 5 



125 130 135 

atg cag etc ctg cac gag ate ate egc aag egg gac aac gcg ttg gag 483 

Met Gin Leu Leu His Glu lie lie Arg Lys Arg Asp Asn Ala Leu Glu 

140 145 150 

etc tec cag ctg gag aac agg ate ctg aac cag aca gec gac atg ctg 531 

Leu Ser Gin Leu Glu Asn Arg lie Leu Asn Gin Thr Ala Asp Met Leu 

155 160 165 170 

cag ctg gee age aag tac aag gae ctg gag cac aag tac cag cac ctg 57 9 

Gin Leu Ala Ser Lys Tyr Lys Asp Leu Glu His Lys Tyr Gin His Leu 

175 180 185 

gee aca ctg gee cac aac caa tea gag ate ate gcg cag ctt gag gag 627 

Ala Thr Leu Ala His Asn Gin Ser Glu lie lie Ala Gin Leu Glu Glu 

190 195 200 

cac tgc cag agg gtg ccc teg gec agg ccc gtc ccc cag eca ecc ccc 675 

His Cys Gin Arg Val Pro Ser Ala Arg Pro Val Pro Gin Pro Pro Pro 

205 210 215 

get gec ccg ccc egg gtc tac caa cca ccc ace tac aac cgc ate ate 723 

Ala Ala Pro Pro Arg Val Tyr Gin Pro Pro Thr Tyr Asn Arg lie lie 

220 225 230 

aac cag ate tct ace aac gag ate cag agt gac cag aac ctg aag gtg 771 

Asn Gin lie Ser Thr Asn Glu lie Gin Ser Asp Gin Asn Leu Lys Val 

2 35 240 245 250 

ctg cca ccc cct ctg ccc act atg ecc act etc ace age etc cca tct 819 

Leu Pro Pro Pro Leu Pro Thr Met Pro Thr Leu Thr Ser Leu Pro Ser 

255 260 265 

tec ace gac aag ccg teg ggc cca tgg aga gac tgc ctg cag gec ctg 867 

Ser Thr Asp Lys Pro Ser Gly Pro Trp Arg Asp Cys Leu Gin Ala Leu 

270 275 280 

gag gat ggc cac gac ace age tec ate tac ctg gtg aag ccg gag aac 915 

Glu Asp Gly His Asp Thr Ser Ser lie Tyr Leu Val Lys Pro Glu Asn 



.-ice aac cgc etc atg cag gtg tgc tgc gac :aq aqa cac gac ccc ggq c >6 ; 

Thr Asn Arg Leu Met Gin Val Trp Cys Asp Gin Arg His Asp Pro Gly 
3 0 0 3 0 5 3 10 

ggc tgg ace gtc ate cag aga egc ctg gat ggc tct gtt aac ttc ttc 1011 

Gly Trp Thr Val lie Gin Arg Arg Leu Asp Gly Ser Val Asn Phe Phe 

3 15 323 325 33 0 

ana aac tac aaa aca tac aaa caa aqq tt. t. gga aac att gat qgc gaa 105 9 



tac aaa etc ctg gtg acc atg gag gac tgg tec ggc egc aaa gtc ttt 1155 
Tyr Lys Leu Leu Val Thr Met Glu Asp Trp Ser Gly Arg Lys Val Phe 

365 370 375 

gca gaa tac gee agt ttc cgc ctg gaa cct gag age gag tat tat aag 1203 
Ala Glu Tyr Ala Ser Phe Arg Leu Glu Pro Glu Ser Glu Tyr Tyr Lys 
38 0 38 5 3 90 

ctg egg ctg ggg egc tac cat ggc aat gcg ggt gac tec ttt aca tgg 1251 
Leu Arg Leu Gly Arg Tyr His Gly Asn Ala Gly Asp Ser Phe Thr Trp 
395 400 405 410 

cac aac ggc aag cag ttc acc acc ctg gac aga gat eat gat gtc tac 12 99 

His Asn Gly Lys Gin Phe Thr Thr Leu Asp Arg Asp His Asp Val Tyr 
415 420 425 

aca gga aac tgt: gee cac tac cag aag gga ggc tgg tgg tat aac gee 1347 
Thr Gly Asn Cys Ala His Tyr Gin Lys Gly Gly Trp Trp Tyr Asn Ala 
4 30 435 440 

tgt gec cac tec aac etc aac ggg gtc tgg tac cgc ggg ggc cat tac 1395 
Cys Ala His Ser Asn Leu Asn Gly Val Trp Tyr Arg Gly Gly His Tyr 
445 450 455 

egg age cgc tac cag gac gga gtc tac tgg get gag ttc cga gga ggc 1443 
Arg Ser Arg Tyr Gin Asp Gly Val Tyr Trp Ala Glu Phe Arg Gly Gly 
460 465 470 

tct tac tea etc aag aaa gtg gtg atg atg ate cga ccg aac ccc aac 1491 
Ser Tyr Ser Leu Lys Lys Val Val Met Met lie Arg Pro Asn Pro Asn 
475 480 485 490 

acc ttc cac taa gccagctccc cctcc 1518 
Thr Phe His 
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•;220 - 

<2 21> gene 

<222> (1) . . (493 ) 

<122> Angiopoietm- like 2 ( ANGPTL2 ) , protein 

< 2 2 C ; 

: 2 1 S I GNAL 



< 22 0 

<2 2 1 
^ ~> 

■:2 2.3 



DOMAIN 
(76) . . (115) 

Coiled Coil (potential) 



<22 0> 

••221. DOMAIN 

•:2 22> { 152) . . (206) 

•:223> Coiled Coil (potential) 



: 2 2 0 : 
: 2 2 1 : 
2 2 2: 
2 2 3: 



DOMAIN 
(438) . . (450) 
Fibrinogen C-terminal 



-22 0> 

< 2 2 2 > 
c223> 



CARBOHYD 
( 164 ) . . ( 164 ) 
N- linked (GLCNAC. 



(potential ) 



<220> 

<221> CARBOHYD 

<222> (192) . . (192) 

<22 3> N- linked (GLCNAC. 



(potential ) 



< 4 0 0 > 8 

Met. Arg Pro Leu Cys Val Thr Cys Trp Trp Leu Gly Leu Leu Ala Ala 
15 10 15 



Met. Gly Ala Val Ala Gly Gin Glu Asp Gly Phe Glu Gly Thr Glu Glu 

2 0 2 5 3 0 



Gly Ser Pro Arg Glu Phe lie Tyr Leu Asn Arg Tyr Lys Arg Ala Gly 
35 40 45 



Glu Ser Gin Asp Lye Cys Thr Tyr Thr Phe lie Val Pro G I Gin Arg 



Val Thr Gly Ala lie Cys Val Asn Ser Lys Glu Pro Glu Val Leu Leu 
65 70 75 80 



Glu Asn Arg Val His Lys Gin Glu Leu Glu Lei; Leu Asn Asn Glu Leu 
8 5 9 0 9 5 



Asp Gly Gly He Val Ser Glu Val Lys Leu Leu Arg Lys Glu Ser Arg 

115 120 125 



Asn Met Asn Ser Arg Val Thr Gin Leu Tyr Met Gin Leu Leu His Glu 
130 135 140 



He He Arg Lys Arg Asp Asn Ala Leu Glu Leu Ser Gin Leu Glu Asn 
145 150 155 160 



Arg He Leu Asn Gin Thr Ala Asp Met Leu Gin Leu Ala Ser Lys Tyr 
165 170 175 



Lys Asp Leu Glu His Lys Tyr Gin His Leu Ala Thr Leu Ala His Asn 
180 185 190 



Gin Ser Glu He lie Ala Gin Leu Glu Glu His Cys Gin Arg Val Pro 

195 2 00 2 05 



Ser Ala Arg Pro Val Pro Gin Pro Pro Pro Ala Ala Pro Pro Arg Val 
210 215 220 



Tyr Gin Pro Pro Thr Tyr Asn Arg He lie Asn Gin He Ser Thr Asn 
225 230 235 240 



Glu lie Gin Ser Asp Gin Asn Leu Lys Val Leu Pro Pro Pro Leu Pro 
245 250 255 



Thr Met Pro Thr Leu Thr Ser Leu Pro Ser Ser Thr Asp Lys Pro Ser 
260 265 270 



Gly Pro Tip Arg Asp Cyr. Leu Gin Ala Leu Glu Asp Gly His Asp Thr 
2^5 28C 2 85 



Ser Ser lie Tyr Leu Val Lys Pro Glu Asn Thr Asn Aig Leu Met Gin 
290 295 3 0 C 



Val Trp Cys Asp Gin Arg His Asp Pro Gly Gly Trp Thr Val lie Gin 



340 345 350 



Asn lie Tyr Trp Leu Thr Asn Gin Gly Asn Tyr Lys Leu Leu Val Thr 

355 360 365 



Met Glu Asp Trp Ser Gly Arg Lys Val Phe Ala Glu Tyr Ala Ser Phe 

3 7 0 375 380 



Arg Leu Glu Pro Glu Ser Glu Tyr Tyr Lys Leu Arg Leu Gly Arg Tyr 
385 390 395 400 



His Gly Asn Ala Gly Asp Ser Phe Thr Trp His Asn Gly Lys Gin Phe 
405 410 415 



Thr Thr Leu Asp Arg Asp His Asp Val Tyr Thr Gly Asn Cys Ala His 
42C 425 430 



Tyr Gin Lys Gly Gly Trp Trp Tyr Asn Ala Cys Ala His Ser Asn Leu 
435 440 445 



Asn Gly Val Trp Tyr Arg Gly Gly His Tyr Arg Ser Arg Tyr Gin Asp 
450 455 460 



Gly Val Tyr Trp Ala Glu Phe Arg Gly Gly Ser Tyr Ser Leu Lys Lys 
46 5 470 475 480 



Val Val Met Met lie Arg Pro Asn Pro Asn Thr Phe His 
485 490 



: 2 1 0 9 

:ii> 2 .1 3 3 

. 2 1 2 :■ DNA 

2 1 3 - H omc s a p l e i : s 



;221> gene 

: 2 2 2 > { 1 ) . . ( 2 1 3 3 ) 

;223> Human SPARC/osteonectin mRNA 

2 2 0-:. 

. 22 1> CDS 



atg agg gcc tgg ate ttc ttt etc ctt tgc ctg gec ggg agg gec ttg 105 

Met Arg Ala Trp lie Phe Phe Leu Leu Cys Leu Ala Gly Arg Ala Leu 

15 10 15 

gca gcc cct cag caa gaa gcc ^tg cct gat gag aca gag gtg gtg gaa 153 

Ala Ala Pro Gin Gin Glu Ala Leu Pro Asp Glu Thr Glu Val Val Glu 
20 25 30 

gaa act gtg gca gag gtg act gag gta tct gtg gga get aat cct gtc 201 

Glu Thr Val Ala Glu Val Thr Glu Val Ser Val Gly Ala Asn Pro Val 

35 40 45 

cag gtg gaa gta gga gaa ttt gat gat ggt gca gag gaa acc gaa gag 24 9 

Gin Val Glu Val Gly Glu Phe Asp Asp Gly Ala Glu Glu Thr Glu Glu 

50 55 60 

gag gtg gtg gcg gaa aat ccc tgc cag aac cac cac tgc aaa cac ggc 2 97 

Glu Val Val Ala Glu Asn Pro Cys Gin Asn His His Cys Lys His Gly 

65 70 75 8(J 

aag gtg tgc gag ctg gat gag aac aac acc ccc atg tgc gtg tgc cag 345 

Lys Val Cys Glu Leu Asp Glu Asn Asn Thr Pro Met Cys Val Cys Gin 

85 90 95 

gac ccc acc age tgc cca gcc ccc att ggc gag ttt gag aag gtg tgc 393 

Asp Pro Thr Ser Cys Pro Ala Pro lie Gly Glu Phe Glu Lys Val Cys 
100 105 110 

age aat gac aac aag acc ttc gac tct tec tgc cac ttc ttt gcc aca 441 

Ser Asn Asp Asn Lys Thr Phe Asp Ser Ser Cys His Phe Phe Ala Thr 

115 120 125 

aag tgc acc ctg gag ggc acc aag aag ggc cac aag etc cac ctg gac 489 

Lys Cys Thr Leu Glu Gly Thr Lys Lys Gly His Lys Leu His Leu Asp 

130 135 140 

tac ate ggg cct tgc aaa tac ate ccc cct tgc ctg gac tct gag ctg 537 

Tyr lie Gly Pro Cys Lys Tyr lie Pro Pro Cys Leu Asp Ser Glu Leu 

145 150 155 160 

acc gaa ttc ccc ctg cgc atq ::cg gac tgg etc aag aac ate ctg jtc [ ?S5 

T:"il" Glu Phe Pro Leu Arg Met Arg Asp Trp Leu Lys As:; Val Leu Val 

acc ctg tat gag agg gat gag gac aac aac ctt ctg act. gag aag cag 633 

Thr Leu Tyr Glu Arg Asp Glu Asp Asn Asn Leu Leu Thr Glu Lys Gin 
180 185 190 

aag ctg egg gtg aag aag ate cat gag aat gag aag cgc ctg gag gca 68 1 

Lys Leu Arg Val Lys Lys lie His Glu Asn Glu Lys Arg Leu Glu Ala 

19 5 2 0 j 2 0 5 



Asn Met Tyr lie Phe Pro Val His Trp Gin Phe Gly Gin Leu Asp Gin 
225 23 0 235 240 

cac cc2 att gac ggg tac etc tec cac acc gag ctg get cca ctg cgt 825 

His Pro lie Asp Gly Tyr Leu Ser His Thr Glu Leu Ala Pro Leu Arg 
245 250 255 

get cec etc ate cce atg gag eat tge ace acc egc ttt ttc gag acc 873 

Ala Pro Leu lie Pro Met Glu His Cys Thr Thr Arg Phe Phe Glu Thr 
26 0 265 270 

tgt gae ctg gae aat gac aag tae ate gee etg gat gag tgg gec ggc 92 1 

Cys Asp- Leu Asp Asn Asp Lys Tyr lie Ala Leu Asp Glu Trp Ala Gly 
275 280 285 

tge ttc ggc ate aag cag aag gat ate gae aag gat ett gtg ate taa 969 

Cys Phe Gly lie Lys Gin Lys Asp lie Asp Lys Asp Leu Val lie 

290 295 300 

atccaetcet tceacagtac eggattctet etttaacect ccccttcgtg tttcceccaa 1029 

tgtttaaaat gtttggatgg tttgttgttc tgcetggaga eaaggtgcta aeatagattt 1089 

aagtgaatac attaaeggtg ctaaaaatga aaattctaac ecaagacatg aeattcttag 114 9 

etgtaaetta aetattaagg ccttttecac aegeattaat agteecattt ttctettgee 1209 

atttgtaget ttgeccattg tcttattggc aeatgggtgg acaeggatct getgggctct 1269 

geettaaaea caeattgeag cttcaacttt tetetttagt gttetgtttg aaactaatac 1329 

ttacegagtc agaetttgtg ttcatttcat tteagggtct tggctgcetg tgggcttccc 1339 

eaggtggeet ggaggtggge aaagggaagt aaeagacaca egatgttgtc aaggatggtt 14 4 9 

ttgggactag aggrtcagtg gtgggagaga teeetgeaga atceaccaac cagaaegtgg 150 9 

tttgeetgag getgtaactg agagaaagat tetggggetg tcttatgaaa atatagacat 1569 

tetcaeataa gcc:agttca teaccattte ctc:tttacc ttteagtgca gtttettttc 16.29 

aeattagqrt gttggtteaa acttttggga gcarggartg tcagttctct gggaagtggt 163 9 

cagcgcatL-c tgeagggctt ctectcctet gt^ttttgga gaae^aggge tettctcagg 1 4 9 

gg^tetaggg aetgecagge tgtttcagee aggaaggeea aaat:aagag tgagatgtag 180 9 

aaagttgtaa aatagaaaaa gtggagttgg tgaatcggtt gttetttect cacatttgga 1869 

tgat tgt cat aaggt tt t ta gcat gt t cet cct: tt t. ct t. c accetcccct t. tgt t. ::t tct 192 9 



aaaaaaaaaa aaaaaaaaaa aaaa 



<210- 10 

<211;. 303 

< 2 1 2 : • PPT 

<2 13:- Homo sapiens 



SIGNAL 

( 1 > . . U 



<220 > 

< 22 1 r. gene 

<222> (1) . . (303) 

<22-'> SPARC , osteonectin 



<::::> chain 

<222> (18) . . (303) 

<22 3> SPARC 

< 2 2 0 > 

<221> DOMAIN 

<222> (22 ) . . (69) 

<22;> AS P ./ GIjU -RICH (ACIDIC, BINDS CALCIUM) 



< 2 2 1 > DOMAIN 
<222> (94 ) . . (149) 
<22 3> KAZAL-LIKE 

< 2 2 0 > 

<22 1> DOMAIN 

<222> (185) . . (221) 

<22:> ALPHA- HELIX CONFORMATION (PROBABLE) 

< 2 2 0 > 

•;2 2 l> CA BIND 

<. 2. ■: 2 ^'4 ) . . ( 285 

- ■ -■ 12 EF - HAND 



CAR BOH YD 
(lib) . . (116) 
N- LINKED (GLCNAC 



) (PROBABLE) 



<220> 

<221> DISULFID 
<222> (95) . . (13 0) 
< 2 2 3 > 

<220 > 



' 2 2 > 
: 2 3 > 



DISULFID 
(1C1) . . (123) 



< 2 2 0 > 

<221> DISULFID 
<222> {112} . . (149) 
<22 3> 

<220> 

<221> DISULFID 
<~.222.> (155) . . (265) 
<223> 

<220> 

<221> DISULFID 
<222> (273) . . (289) 
<223> 



c: 4 0 0 > 10 

Met Arg Ala Trp lie Phe Phe Leu Leu Cys Leu Ala Gly Arg Ala Leu 
15 10 15 



Ala Ala Pro Gin Gin Glu Ala Leu Pro Asp Glu Thr Glu Val Val Glu 

20 25 30 



Glu Thr Val Ala Glu Val Thr Glu Val Ser Val Gly Ala Asn Pro Val 
35 40 45 



JIii Val Glu Val Gly Glu Phe Asp Asp Gly Ala Glu Glu Thr Glu Glu 



Glu Val Val Ala Glu Asn Pro Cys Gin Asn His His Cys Lys His Gly 

6 5 7 0 7 5 SO 



Val Cys Glu Leu Asp Glu Asn Asn, Thr Pro Met Cys Val Cys Gin 
85 90 95 



Ser Asn Asp Asn Lys Thr Phe Asp Ser Ser Cys His Phe Phe Ala Thr 
115 120 125 



Lys Cys Thr Leu Glu Gly Thr Lys Lys Gly His Lys Leu His Leu Asp 
130 135 140 



Tyr lie Gly Pro Cys Lys Tyr lie Pro Pro Cys Leu Asp Ser Glu Leu 
145 150 155 160 



Thr Glu Phe Pro Leu Arg Met Arg Asp Trp Leu Lys Asn Val Leu Val 
165 170 175 



Thr Leu Tyr Glu Arg Asp Glu Asp Asn Asn Leu Leu Thr Glu Lys Gin 
180 185 190 



Lys Leu Arg Val Lys Lys lie His Glu Asn Glu Lys Arg Leu Glu Ala 
195 200 205 



Gly Asp His Pro Val Glu Leu Leu Ala Arg Asp Phe Glu Lys Asn Tyr 
210 215 220 



Asn Met Tyr lie Phe Pro Val His Trp Gin Phe Gly Gin Leu Asp Gin 
225 230 235 240 



His Pro lie Asp Gly Tyr Leu Ser His Thr Glu Leu Ala Pro Leu Arg 
245 250 255 



Ala Pro Leu lie Pro Met Glu His Cys Thr Thr Arg Phe Phe Glu Thr 
260 265 270 



Asp Leu Asp Asn Asp Lys Tyr Tie Ala Leu Asp Glu Trp Ala Gl; 
2 7b 280 285 



Phe Gly lie Lys Gin Lys Asp lie- Asp Lys Asp Leu Val lie 
290 295 3 00 



2 10> 

:: 1 1 -> 

2 1 2 > 



11 

4 586 
DNA 



<220> 

<221> CDS 



< ZslZ : > 

<22 3> 



:i95) . . (4367) 



< 3 0 0 > 

<301> Park,M., Dean ( M. , Kaul,K., Braun,M.J., Gonda,M.A. and Vande Woude , G . 

<302> Sequence of MET protooncogene cDNA has features characteristic of the 
tyroGine kinase family of growth- factor receptors 

<303> Proc. Natl. Acad. Sci . U.S.A. 

<304> 84 

< 3 0 5 > 18 
<306> 6379-6383 
<307> 1987 
<309> 

<313> ( I } . . { 4 5 6 6 ) 



< 4 0 0 > 11 

gaattccgcc ctcgccgccc gcggcgcccc gagcgctttg tgagcagatg cggagccgag 60 

tggagggcgc gagccagatg cggggcgaca gctgacttgc tgagaggagg cggggaggcg 120 

cggagcgcgc gtgtggtcct tgcgccgctg acttctccac tggttcctgg gcaccgaaag 180 

ataaacctct cata atg aag gcc ccc get gtg ctt gca cct ggc ate etc 230 

Met Lys Ala Pro Ala Val Leu Ala Pro Gly lie Leu 
1 5 10 

gtg etc ctg ttt acc ttg gtg cag agg age aat ggg gag tgt aaa gag 278 

Val Leu Leu Phe Thr Leu Val Gin Arg Ser Asn Gly Glu Cys Lys Glu 
15 20 25 

gca eta gca aag tec gag atg aat gtg aat atg aag tat eag ctt ccc 326 

Ala Leu Ala Lys Ser Glu Met Asn Val Asn Met Lys Tyr Gin Leu Pro 

3 0 3 5 4 0 

aae t tc acc gcg gaa aca ccc ate cag aat gtc att eta eat gag rat 374 

As:: Phe Thr Ala Glu Thr Pro lie Gin Asn Val lie Leu His Glu His 

4 5 5 ? 5 5 6 C 

cac att ttc ctt ggt gee act aae tae att tat gtt eta aat gag gaa 422 

His lie Phe Leu Gly Ala Thr Asn Tyr lie Tyr Val Leu Asn Glu Glu 
65 70 75 

gac ctt cag aag gtt get gag tae aag act ggg ect gtg ctg gaa cac 470 

Asp Leu Gin Lye Val Ala Glu Tyr Lys Thr Gly Pro Val Leu Glu His 
8 0 3 5 9 0 



Gly Gly Val Trp Lys Asp Asn lie Asn Met Ala Leu Val Val Asp Thr 
110 115 120 

tac tat gat gat caa etc att age tgt ggc age gtc aac aga ggg ace 614 

Tyr Tyr Asp Asp Gin Leu lie Ser Cys Gly Ser Val Asn Arg Gly Thr 

125 130 135 140 

tgc cag cga cat gtc ttt ccc cac aat cat act get gac ata eag teg 662 

Cys Gin Arg His Val Phe Pro His Asn His Thr Ala Asp lie Gin Ser 

145 150 155 

gag gtt cac tgc ata ttc tec cca cag ata gaa gag ccc age cag tgt 710 

Glu Val His Cys lie Phe Ser Pro Gin lie Glu Glu Pro Ser Gin Cys 
160 165 170 

cct gac tgt gtg gtg age gee ctg gga gee aaa gtc ctt tea tct gta 758 

Pro Asp Cys Val Val Ser Ala Leu Gly Ala Lys Val Leu Ser Ser Val 
175 180 185 

aag gac egg ttc ate aae ttc ttt gta ggc aat acc ata aat tct tct 806 

Lys Asp Arg Phe lie Asn Phe Phe Val Gly Asn Thr lie Asn Ser Ser 
190 195 200 

tat ttc cca gat cat cca ttg cat teg ata tea gtg aga agg eta aag 854 

Tyr Phe Pro Asp His Pro Leu His Ser lie Ser Val Arg Arg Leu Lys 

205 210 215 220 

gaa acg aaa gat ggt ttt atg ttt ttg acg gac cag tec tac att gat 902 

Glu Thr Lys Asp Gly Phe Met Phe Leu Thr Asp Gin Ser Tyr lie Asp 

225 230 235 

gtt tta cct gag ttc aga gat tct tac ccc att aag tat gtc eat gee 950 

Val Leu Pro Glu Phe Arg Asp Ser Tyr Pro lie Lys Tyr Val His Ala 
240 245 250 

ttt gaa age aac aat ttt att tac ttc ttg acg gtc caa agg gaa act 998 

Phe Glu Ser Asn Asn Phe lie Tyr Phe Leu Thr Val Gin Arg Glu Thr 
255 260 265 

eta gat get cag act ttt cac aca aga ata ate agg ttc tgt tec ata 1046 

Leu Asp Ala Gin Thr Phe His Thr Arg lie He Arg Phe Cys Ser He 

270 275 28 0 

aac tct gga ttg cat tec tac atg gaa atg cct ctg gag tgt att etc 10 94 

Asn Ser Gly Leu His Ser Tyr Met Glu Met Pro Leu Glu Cys He Leu 

285 290 295 3 00 

aca gaa aag aga aaa aag aga tec aca aag aag gaa gtg ttt aat ata 1142 

Thr Glu Lys Arg Lys Lys Arg Ser Thr Lys Lys Glu Val Phe Asn He 

3 0 5 3 10 3 15 



335 340 345 

age aag cca gat tct gec gaa cca atg gat cga tct gec atg tgt gca 1286 

Ser Lys Pro Asp Ser Ala Glu Pro Met Asp Arg Ser Ala Met Cys Ala 

3 5 C 3 5 5 3 6 0 

ttc cct ate aaa tat gt: aac gac ttc ttc aac aag ate gtc aae aaa 1334 

Phe Pro lie Lys Tyr Val Asn Asp Phe Phe Asn Lys lie Val Asn Lys 

3 65 3 70 3 75 3 80 

aa: aat gtg aga tgt ete eag cat ttt tac gga ccc aat cat gag cac 1382 

Asn Asn Val Arg Cys Leu Gin His Phe Tyr Gly Pro Asn His Glu His 

385 390 395 

tgo ttt aat agg aca ctt ctg aga aat tea tea ggc tgt gaa gcg ege 14 3 0 

Cys Phe Asn Arg Thr Leu Leu Arg Asn Ser Ser Gly Cys Glu Ala Arg 

400 405 410 

cgt gat gaa tat cga aca gag ttt acc aca get ttg eag cgc gtt gac 14/8 

Arg Asp Glu Tyr Arg Thr Glu Phe Thr Thr Ala Leu Gin Arg Val Asp 

415 420 425 

tta ttc atg ggt caa ttc age gaa gtc etc tta aca tct ata tec ace 1526 

Leu Phe Met Gly Gin Phe Ser Glu Val Leu Leu Thr Ser lie Ser Thr 

430 435 440 

ttc att aaa gga gac etc acc ata get aat ctt ggg aca tea gag ggt 1574 

Phe lie Lys Gly Asp Leu Thr lie Ala Asn Leu Gly Thr Ser Glu Gly 

445 450 455 46 0 

cgc ttc atg eag gtt gtg gtt tct cga tea gga cca tea acc ect eat 1622 

Arg Phe Met Gin Val Val Val Ser Arg Ser Gly Pro Ser Thr Pro His 

465 470 475 

gtg aat ttt etc ctg gac tec eat oca gtg tct cca gaa gtg att gtg 1670 

Val Asn Phe Leu Leu Asp Ser His Pro Val Ser Pro Glu Val lie Val 

480 485 490 

gag cat aca tta aac caa aat ggc tac aca ctg gtt ate act ggg aag 1718 

Glu His Thr Leu Asn Gin Asn Gly Tyr Thr Leu Val lie Thr Gly Lys 

4 9 5 5:0 535 

, i : ate acg aag r • cca ttg aat gg:' ' : .: gac t gc =iga cat 1 t c 'a : . " 

Lys lie Thr Lys lie Pro Leu Asn Gly Leu Gly Cys Arg His Phe Gin 

513 515 5 2 0 

tec tge agt caa tgc etc tct gec cca ccc ttt gtt cag tgt ggc tgg 1814 

Ser Cys Ser Gin Cys Leu Ser Ala Pro Pro Phe Val Gin Cys Gly Trp 

525 530 535 540 

tgc cac gac aaa tgt gtg cga teg gag gaa tgc ctg age ggg aca egg 13*32 



gca ccc ctt gaa gga ggg aca agg ctg acc ata tgt ggc tgg gac ttt 1958 

Ala Pro Leu Glu Gly Gly Thr Arg Leu Thr lie Cys Gly Trp Asp Phe 
575 580 585 

gga ttt egg agg aat aat aaa ttt gat tta aag aaa act aga gtt etc 2006 

Gly Phe Arg Arg Asn Asn Lys Phe Asp Leu Lys Lys Thr Arg Val Leu 

59 0 595 600 

ctt gga aat gag age tgc acc ttg act tta agt gag age acg atg aat 2054 

Leu Gly Asn Glu Ser Cys Thr Leu Thr Leu Ser Glu Ser Thr Met Asn 

6 05 610 615 62 0 

aca ttg aaa tgc aca gtt ggt cct gee atg aat aag cat ttc aat atg 2102 

Thr Leu Lys Cys Thr Val Gly Pro Ala Met Asn Lys His Phe Asn Met 

625 630 635 

tec ata att att tea aat ggc cac ggg aca aca caa tac agt aca ttc 2150 

Ser lie lie lie Ser Asn Gly His Gly Thr Thr Gin Tyr Ser Thr Phe 
640 645 650 

tec tat gtg gat cct gta ata aca agt att teg ccg aaa tac ggt cct 2198 

Ser Tyr Val Asp Pro Val lie Thr Ser lie Ser Pro Lys Tyr Gly Pro 
655 660 665 

atg get ggt ggc act tta ctt act tta act gga aat tac eta aac agt 2246 

Met Ala Gly Gly Thr Leu Leu Thr Leu Thr Gly Asn Tyr Leu Asn Ser 
670 675 680 

ggg aat tct aga cac att tea att ggt gga aaa aca tgt act tta aaa 2294 

Gly Asn Ser Arg His lie Ser lie Gly Gly Lys Thr Cys Thr Leu Lys 

685 690 695 700 

agt gtg tea aac agt att ctt gaa tgt tat acc cca gee caa acc att 2342 

Ser Val Ser Asn Ser lie Leu Glu Cys Tyr Thr Pro Ala Gin Thr lie 

705 710 715 

tea act gag ttt get gtt aaa ttg aaa att gac tta gee aac cga gag 2390 

Ser Thr Glu Phe Ala Val Lys Leu Lys lie Asp Leu Ala Asn Arg Glu 

720 7 2 5 730 

aca age ate ttc agt tac egt gaa gat :c: att gtc tat gaa att cat 24 3 S 

Thr Ser He Phe Ser Tyr Arg Glu Asp • i lie Val Tyr Glu lie His 
735 7 4 0 745 

cca acc aaa tct ttt att agt ggt ggg ag: aca ata aca ggt gtt ggg 2486 

Pro Thr Lys Ser Phe He Ser Gly Gly Ser Thr He Thr Gly Val Gly 
750 755 760 

aaa aac ctg aat. tea qtt agt gtc ccg aga atg gtc ata aat gtg cat. 2534 

Lvs Asn Leu Asn Ser Val Ser Val Pro Ara Met Val He Asn Val His 



gag ata ate tgt tgt acc act cct tec ctg caa cag ctg aat ctg caa 2630 

Glu lie lie Cys Cys Thr Thr Pro Ser Leu Gin Gin Leu Asn Leu Gin 

8 0 0 805 810 

etc ccc ctg aaa acc aaa gec ttt ttc atg tta gat ggg ate ctt tec 2678 

Leu Pro Leu Lys Thr Lys Ala Phe Phe Met Leu Asp Gly lie Leu Ser 
815 820 825 

aaa tac ttt gat etc att tat gta cat aat cct gtg ttt aag cct ttt 2726 

Lys Tyr Phe Asp Leu lie Tyr Val His Asn Pro Val Phe Lys Pro Phe 

830 835 840 

gaa aag cca gtg atg ate tea atg ggc aat gaa aat gta ctg gaa att 2774 

Glu Lys Pro Val Met lie Ser Met Gly Asn Glu Asn Val Leu Glu lie 
845 850 355 860 

aag gga aat gat att gac cct gaa gca gtt aaa ggt gaa gtg tta aaa 2822 

Lys Gly Asn Asp lie Asp Pro Glu Ala Val Lys Gly Glu Val Leu Lys 
865 870 875 

gtt gga aat aag age tgt gag aat ata cac tta cat tct gaa gee gtt 2870 

Val Gly Asn Lys Ser Cys Glu Asn lie His Leu His Ser Glu Ala Val 
880 885 890 

tta tgc acg gtc ccc aat gac ctg ctg aaa ttg aac age gag eta aat 2918 

Leu Cys Thr Val Pro Asn Asp Leu Leu Lys Leu Asn Ser Glu Leu Asn 

895 900 905 

ata gag tgg aag caa gca att tct tea acc gtc ctt gga aaa gta ata 2966 

lie Glu Trp Lys Gin Ala lie Ser Ser Thr Val Leu Gly Lys Val lie 

910 915 920 

gtt caa cca gat cag aat ttc aca gga ttg att get ggt gtt gtc tea 3014 

Val Gin Pro Asp Gin Asn Phe Thr Gly Leu lie Ala Gly Val Val Ser 
925 930 935 940 

ata tea aca gca ctg tta tta eta ctt ggg ttt ttc ctg tgg ctg aaa 3062 

lie Ser Thr Ala Leu Leu Leu Leu Leu Gly Phe Phe Leu Trp Leu Lys 
945 950 955 

aag aga aag caa att aaa gat ctg ggc ^gt gaa tta gtt car tac gat -.110 

Lys Arg Lys Gin lie Lys Asp Leu Gly Ser Glu Leu Val Arg Tyr Asp 

7 J 6 0 9 6 5 9 " :■ 

gca aga gta cac act cct cat ttg gat agg ctt gta agt gec cga agt ...15 3 

Ala Arg Val His Thr Pro His Leu Asp Arg Leu Val Ser Ala Arg Ser 
975 980 985 

gta age cca act aca gaa atg gtt tea aat gaa tct gta gac tac cga 3206 

Val Ser Pro Thr Thr Glu Met Val Ser Asn Glu Ser Val Asp Tyr Arg 

9 9 3 9 9 5 10 0 0 




Ser Cys Arg Gin Val Gin 
1020 1025 

:ta act agt ggg gac ten 
Leu Thr Ser Gly Asp Ser 
10 3 5 104 0 

art gtc cac att gac etc 
Thr Val His lie Asp Leu 
10 5 0 10 5 5 

gca gtg cag cat gta gtg 
Ala Val Gin His Val Val 
1065 1070 

ttc aat gaa gtc ata gga 
Phe Asn Glu Val lie Gly 
1080 10 85 

ggg act ttg ttg gac aat 
Gly Thr Leu Leu Asp Asn 
1095 1100 

aaa tec ttg aac aga ate 
Lys Ser Leu Asn Arg lie 
1110 1115 

ctg acc gag gga ate ate 
Leu Thr Glu Gly lie lie 
1125 1130 

etc teg etc ctg gga ate 
Leu Ser Leu Leu Gly lie 
1140 1145 

gtg gtc eta cca tac atg 
Val Val Leu Pro Tyr Met 
1155 1160 

cga aat gag act cat aat 

Arg Asn Glu Thr His Asn 

117 0 1 1 '/ 5 

ggt ctt :aa gta gec aaa 
Gly Leu Gin Val Ala Lys 

118 5 119 0 

ttt gtc cac aga gac ttg 
Phe Val His Arg Asp Leu 



Tyr Pro Leu Thr Asp 

1030 

gat ata tec agt cca 
Asp lie Ser Ser Pro 
1045 

agt get eta aat cca 
Ser Ala Leu Asn Pro 
1060 

att ggg cce agt age 
lie Gly Pro Ser Ser 
1075 

aga ggg cat ttt ggt 
Arg Gly His Phe Gly 

10 9 0 

gat ggc aag aaa att 
Asp Gly Lys Lys lie 
1105 

act gac ata gga gaa 
Thr Asp lie Gly Glu 
1120 

atg aaa gat ttt agt 
Met Lys Asp Phe Ser 
113 5 

tgc ctg cga agt gaa 
Cys Leu Arg Ser Glu 
1150 

aaa eat gga gat ctt 
Lys His Gly Asp Leu 
1165 

cca act gta aaa gat 
Pro Thr Val Lys Asp 

1 1 & 0 

ggc atg aaa tat ctt. 
Gly Met Lys Tyr Leu 
1195 

get gca aga aac tgt 
Ala Ala Arg Asn Cys 




Met Ser Pro lie 



tta ctg caa aat 3341 
Leu Leu Gin Asn 



gag ctg gtc eag 3386 
Glu Leu Val Gin 



ctg att gtg cat 343 1 

Leu lie Val His 



tgt gta tat eat 3476 
Cys Val Tyr His 



cac tgt get gtg 3 521 

His Cys Ala Val 



gtt tec caa ttt 3566 
Val Ser Gin Phe 



eat cce aat gtc 3611 
His Pro Asn Val 



ggg tct ccg ctg 3656 
Gly Ser Pro Leu 



cga aat ttc att 3701 
Arg Asn Phe lie 



ctt att ggc ttt 3746 
Leu Tie Gly Phe 



gca age aaa aag 3 79 1 

Ala Ser Lys Lys 



atg ctg gat gaa 3836 
Met Leu Asp Glu 



1230 1235 1240 

ctg cca gtg aag tgg atg get ttg gaa agt ctg caa act caa aag 3971 

Leu Pro Val Lys Trp Met Ala Leu Glu Ser Leu Gin Thr Gin Lys 

1245 1250 1255 

ttt acc acc aag tea gat gtg tgg tec ttt ggc gtc gtc etc tgg 4016 

Phe Thr Thr Lys Ser Asp Val Trp Ser Phe Gly Val Val Leu Trp 

1260 1265 1270 

gag ctg atg aca aga gga gee cca cct tat cct gac gta aac acc 4 06 1 

Glu Leu Met Thr Arg Gly Ala Pro Pro Tyr Pro Asp Val Asn Thr 

1275 1280 1285 

ttt gat ata act gtt tac ttg ttg caa ggg aga aga etc eta caa 4 106 

Phe Asp lie Thr Val Tyr Leu Leu Gin Gly Arg Arg Leu Leu Gin 

1290 1295 1300 

ecc gaa tac tgc cca gac ccc tta tat gaa gta atg eta aaa tgc 4151 

Pro Glu Tyr Cys Pro Asp Pro Leu Tyr Glu Val Met Leu Lys Cys 

1305 1310 1315 

tgg eac cct aaa gee gaa atg cgc cca tec ttt tct gaa ctg gtg 4196 

Trp His Pro Lys Ala Glu Met Arg Pro Ser Phe Ser Glu Leu Val 

1320 1325 1330 

tee egg ata tea gcg ate ttc tct act ttc att ggg gag eac tat 4241 

Ser Arg lie Ser Ala lie Phe Ser Thr Phe lie Gly Glu His Tyr 

1335 1340 1345 

gtc eat gtg aac get act tat gtg aac gta aaa tgt gtc get ccg 4286 

Val His Val Asn Ala Thr Tyr Val Asn Val Lys Cys Val Ala Pro 

1350 1355 1360 

tat cct tct ctg ttg tea tea gaa gat aac get gat gat gag gtg 4331 

Tyr Pro Ser Leu Leu Ser Ser Glu Asp Asn Ala Asp Asp Glu Val 

1365 1370 13 75 

gac aca cga cca gee tec ttc tgg gag aca tea tag tgctagtaet 4377 

Asp Thr Arg Pro Ala Ser Phe Trp Glu Thr Ser 

:3.^0 1^85 1390 

a:gtcaaagc aacaglccac act :.. t. g:cca atggtttttt cactgcctga cctttaaaag 44 3" 

gecategata ttctttgetc ettgecaaat tgcactatta ataggacttg tattgttatt 44 97 

taaattactg gattctaagg aatttcttat ctgacagagc atcagaacca gaggettggt 4557 



cccacaggcc agggaccaat gcgctgcag 



4 58 6 





< 2 2 1 > 



SIGNAL 
(1) . . (24) 




< 2 2 0 > 

<22i> gene 

<222> (1) . . (1390) 

<223> C-Met Proto Oncogene 

< 2 2 C) > 

<22i> CHAIN 

<222> (25) . . (1390) 

<223> HFPATOCYTE GROWTH FACTOR RECEPTOR 
<220> 

<221> DOMAIN 

<222> (25) . . (932) 



<221> TRANSMEM 

<222> (933) . . (955) 

<223> POTENTIAL 

<220> 

<221> DOMAIN 

<222> (956) • . (1390) 

<223> CYTOPLASMIC (POTENTIAL) 

<220> 

<221> DOMAIN 

<222> (55) . . (500) 

<223> SEMA. 

<220> 

<221> DOMAIN 

<222> (1078) . . (1345) 

<223> PROTEIN KINASE 

< 22 0 > 



SITE 

(10 0 9; . . (1010) 

Breakpoint for Translocation t.o Form TPR-Met Oncogene 



EXTRACELLULAR { POTENTIAL ) 




NP_BIND 
( 1084 } . . (1092 ) 
ATP (BY SIMILARITY) 




BINDING 



BY SIMILARITY 



< 2 2 0 r. 
<221 

< 2 2 2 
<223 

<220 
<221 
<222 
<22 3: 

<220v 

< 2 2 1 ;■■ 
<222; 
«" ° 3 - • 

<220> 

< 2 2 1 > 
<222> 
<223> 



SITE 

(3 0 7) . . (308) 
CLEAVAGE ( POTENTIAL ) 



mo:>_res 

{12 3 5; . . (1235) 
PHOSPHORYLATION (AUTO- 



CARBOHYD 
(45) . . (4 5) 

N- LINKED { GLCNAC . . . ) ( POTENTIAL ) 



CARBOHYD 
(106) . . (106) 

N- LINKED (GLCNAC. . . ) ( POTENTIAL ) 



<22 0> 

<221> CARBOHYD 

<222> (149) .. ( 149) 

<223> N - LINKED (GLCNAC...) (POTENTIAL) 



< 2 2 0 > 

<221> CARBOHYD 

<222> (202) . . (202) 

<223> N - LINKED (GLCNAC. 



[POTENTIAL) 



<220> 

<221> CARBOHYD 

<222> (399) . . (399) 

<223> N- LINKED (GLCNAC. 



I POTENTIAL) 



<220> 

<221> CAP.BOHYD 

2 2 • 4 05 ■ . . (405) 

n- linked (glcnac 



CARBl-HYD 

b C 7 ..(60 7) 
N -LINKED (GLCNAC 



POTENTIAL 



! POTENTIAL) 



CARBOHYD 

(6 3 5 . . ( 6 3 5 : 
N - LINKED (GLCNAC 



' ! POTENTIAL 



: 2 2 0 > 

:221> CARBOHYD 

222 • (8"9) . . (879) 

2 2.:: ■ IT - LINKED (GLCNAC. . . ) ( POTENTIAL ) 



2 2 0: 



CARBOHYD 
(920) . . (930) 

U- LINKED { GLCNAC . . . ) (POTENTIAL) 



< 2 2 0 : • 

<221> VARSPLIC 

-222:- : 755) . . (755) 

<223> S -> S TWWKE PLN I VS FL FC F AS (IN ISOFORM 2) 
' 2 0 > 

^21> VARIANT 

^22:- (320) . . (320) 

'121 > A - > V 



< 2 2 0 > 

<221> VARIANT 

<222> ( 1131) . . (1131) 

M - > T (IN HPRC; GERMLINE MUTATION) 



<220> 

< 2 2 1 > VARIANT 
<222> (1188) . . (1188) 

<223> V -> L (IN HPRC; GERMLINE MUTATION) 

< 2 2 0 > 

<221> VARIANT 

<222> ( 1195) . . (1195) 

<22 3:- L -> V (IN HPRC; GERMLINE MUTATION) 

< 2 2 0 > 

<221> VAP.IANT 

<222> (1220) . . (1220) 

<223:- V - > I (IN HPRC; GERMLINE MUTATION) 



VARIANT 

1 : 2 8 : . . ■; 2 2 8 ) 
D -> N (IN HPRC ; GERMLINE MUTATION) 



VAR.IANT 
■,1228) . . (1228) 

D -> H (IN HPRC; GERMLINE MUTATION) 



: 2 (! -■■ 



<2 23> 



(1230) . . (1230) 

Y -> H (IN HPRC; GERMLINE MUTATION) 



:220 
:221 
:222 
:223 



VARIANT 
( 1250 ) . . ( 1250 ) 

M - > T (IN HPRC; GERMLINE MUTATION) 



<400> 12 

Met Lys Ala Pro Ala Val Leu Ala Pro Gly lie Leu Val Leu Leu Phe 
15 10 15 



Thr Leu Val Gin Arg Ser Asn Gly Glu Cys Lys Glu Ala Leu Ala Lys 

20 25 30 



Ser Glu Met Asn Val Asn Met Lys Tyr Gin Leu Pro Asn Phe Thr Ala 
35 40 45 



Glu Thr Pro lie Gin Asn Val lie Leu His Glu His His lie Phe Leu 
50 55 60 



Gly Ala Thr Asn Tyr lie Tyr Val Leu Asn Glu Glu Asp Leu Gin Lys 
6 5 70 75 80 



Val Ala Glu Tyr Lys Thr Gly Pro Val Leu Glu His Pro Asp Cys Phe 
85 90 95 



Pro Cys Gin Asp Cys Ser Ser Lys Ala Asn Leu Ser Gly Gly Val Trp 
100 105 110 



Asp Asn He Asn Met Ala Leu Val Val Asp Thr Tyr Tyr Asp Asp 

115 120 125 



y±i\ Leu lie Ser Cys Gly Ser Val Asn Arg Gly Thr Cys Gin Arg His 
130 135 140 



Val Phe Pro His Asn His Thr Ala Asp He Gin Ser Glu Val His Cys 
145 150 155 160 



lie Asn Phe Phe Val Gly Asn Thr lie Asn Ser Ser Tyr Phe Pro Asp 
195 2 00 205 



His Pro Leu His Ser lie Ser Val Arg Arg Leu Lys Glu Thr Lys Asp 

210 215 220 



Gly Phe Met Phe Leu Thr Asp Gin Ser Tyr lie Asp Val Leu Pro Glu 
225 230 235 24 0 



Phe Arg Asp Ser Tyr Pro lie Lys Tyr Val His Ala Phe Glu Ser Asn 
245 250 255 



Asn Phe lie Tyr Phe Leu Thr Val Gin Arg Glu Thr Leu Asp Ala Gin 
260 265 270 



Thr Phe His Thr Arg lie lie Arg Phe Cys Ser lie Asn Ser Gly Leu 
275 280 285 



His Ser Tyr Met Glu Met Pro Leu Glu Cys lie Leu Thr Glu Lys Arg 

290 295 300 



Lys Lys Arg Ser Thr Lys Lys Glu Val Phe Asn lie Leu Gin Ala Ala 
305 310 315 320 



Tyr Val Ser Lys Pro Gly Ala Gin Leu Ala Arg Gin lie Gly Ala Ser 
325 330 335 



Leu Asn Asp Asp lie Leu Phe Gly Val Phe Ala Gin Ser Lys Pro Asp 

340 3 45 3 50 



Ala Glu Pre Met Asp Arq Ser Ala Met Cys Al a Phe Pro lie Lyr- 
3 5 5 3 6 0 3 6b 



Tyr Val Asn Asp Phe Phe Asn Lys lie Val Asn Lys Asn Asn Val Arg 
370 375 380 



' ''.'r Leu Gin His Phe Tyr Gly Pro Asn His Glu His Cys Phe Asn Arq 



Arg Thr Glu Phe Thr Thr Ala Leu Gin Arg Yal Asp Leu Phe Met 
420 425 430 



Gin Phe Ser Glu Val Leu Leu Thr Ger lie Ser Thr Phe lie Lys Gly 
4 35 440 445 



Asp Leu Thr lie Ala Asn Leu Gly Thr Ger Glu Gly Arg Phe Met jln 
45 0 455 460 



Val Val Val Ser Arg Ser Gly Pro Ser Thr Pre- His Val Asn Phe Leu 
465 470 4^5 480 



Leu Asp Ser His Pro Val Ser Pro Glu Val He Val Glu His Thr Leu 
485 4 90 495 



Asn Gin Asn Gly Tyr Thr Leu Val He Thr Gly Lys Lys He Thr Ly; 
500 505 510 



lie Pro Leu Asn Gly Leu Gly Cys Arg His Phe Gin Ser Cys Ser Gin 
515 520 525 



Cys Leu Ser Ala Pro Pro Phe Val Gin Cys Gly Trp Cys His Asp Lys 
530 535 540 



Cys Val Arg Ser Glu Glu Cys Leu Ser Gly Thr Trp Thr Gin Gin He 
545 550 555 560 



Cys Leu Pro Ala He Tyr Lys Val Phe Pro Asn Ser Ala Pro Leu Glu 

565 5 7 0 575 



no r :if ■ 



Asn Asn Lys Phe Asp Leu Lys Lys Thi Arg Val Leu Leu Giy Asn Glu 

595 6C0 605 



Ser Asn Giy His Sly Thr Thr Gin Tyr Ser Thr Phe Ser Tyr Val Asp 
645 650 655 



Pro Val lie Thr Ser lie Ser Pro Lys Tyr Gly Pro Met Ala Gly Giy 

6 6 0 6 6 5 6 7 0 



Tlir Leu Leu Thr Leu Thr Gly Asn Tyr Leu Asn Ser Gly Asn Ser Arg 
6 7 5 6 3 0 6 S 5 



His lie Ser lie Gly Gly Lys Thr Cys Thr Leu Lys Ser Val Ser Asn 
690 695 70 0 



Ser lie Leu Glu Cys Tyr Thr Pro Ala Gin Thr lie Ser Thr Glu Phe 

705 710 715 72 0 



Ala Val Lys Leu Lys lie Asp Leu Ala Asn Arg Glu Thr Ser lie Phe 
725 730 735 



Ser Tyr Arg Glu Asp Pro lie Val Tyr Glu lie His Pro Thr Lys Ser 
740 745 750 



Phe lie Ser Gly Gly Ser Thr lie Thr Gly Val Gly Lys Asn Leu Asn 
^55 7 60 765 



Ser Val Ser Val Pro Arg Met Val lie Asn Val His Glu Ala Gly Arc 
770 775 780 



Asn Phe Thr Val Ala Cys Gin His Arg Ser Asn Ser Glu lie lie Cys 
7 85 7 90 7 95 800 



r o int. .-.e . .•■ . .- -. . ► • . / . . . , . . i . . _ . . y : rue ^ 

3 20 825 830 



Leu lie Tvr Val His Asn Pro Val Plie Lys Pre Phe Glu Lys Pro Val 



865 870 875 880 



Ser Cys Glu Asn He His Leu His Ser Glu Ala Val Leu Cys Thr Val 
885 390 895 



Pro Asn Asp Leu Leu Lys Leu Asn Ser Glu Leu Asn He Glu Trp Lys 
900 905 910 



Gin Ala He Ser Ser Thr Val Leu Gly Lys Val He Val Gin Pro Asp 
915 920 925 



Gin Asn Phe Thr Gly Leu He Ala Gly Val Val Ser He Ser Thr Ala 
930 935 940 



Leu Leu Leu Leu Leu Gly Phe Phe Leu Trp Leu Lys Lys Arg Lys Gin 
945 950 955 960 



lie Lys Asp Leu Gly Ser Glu Leu Val Arg Tyr Asp Ala Arg Val His 
965 970 975 



Thr Pro His Leu Asp Arg Leu Val Ser Ala Arg Ser Val Ser Pro Thr 
980 985 990 



Thr Glu Met Val Ser Asn Glu Ser Val Asp Tyr Arg Ala Thr Phe Pro 
995 1000 1005 



Glu Asp Gin Phe Pro Asn Ser Ser Gin Asn Gly Ser Cys Arg Gin 
1010 1015 1020 



Val Gin Tvr Pro Leu Thr Asp Met Ser Pro He Leu Thr Ser Glv 
10 2 5 1 0 3 0 10 3 5 



sp Ser Asp He Ser Ser Pro Leu Leu Gin Asn Thr Val His He 
10 4 0 10 4 5 1 0 5 0 



Asp Leu Ser Ala Leu Asn Pro Glu Leu Val Gin Ala Val Gin His 

10 5 5 10 6 0 10 6 5 



1085 1090 1095 



Asp Asn Asp Gly Lys Lys lie His Cys Ala Val Lys Ser Leu Asn 
1100 110 5 1110 



Arg He Thr Asp He Gly Glu Val Ser Gin Phe Leu Thr Glu Gly 
1115 1120 1125 



He He Met Lys Asp Phe Ser His Pro Asn Val Leu Ser Leu Leu 
1130 1135 1140 



Gly lie Cys Leu Arg Ser Glu Gly Ser Pro Leu Val Val Leu Pro 
1145 1150 1155 



Tyr Met Lys His Gly Asp Leu Arg Asn Phe He Arg Asn Glu Thr 
1160 1165 1170 



His Asn Pro Thr Val Lys Asp Leu He Gly Phe Gly Leu Gin Val 
1175 1180 1185 



Ala Lys Gly Met Lys Tyr Leu Ala Ser Lys Lys Phe Val His Arg 
1190 1195 1200 



Asp Leu Ala Ala Arg Asn Cys Met Leu Asp Glu Lys Phe Thr Val 
1205 1210 1215 



Lys Val Ala Asp Phe Gly Leu Ala Arg Asp Met Tyr Asp Lys Glu 
1220 1225 1230 



Tyr Tyr Ser Val His Asn Lys Thr Gly Ala Lys Leu Pro Val Lyz 

12 3 5 12 4 0 124 5 



Trp Met. Ala Leu Glu Ser Leu Gin Thr Gin Lys Phe Thr Thr Lyi 

12 5 0 12 5 5 12 6 0 



Ser Asp Val Trp Ser Phe Gly Val Val Leu Trp Glu Leu Met Thr 
1265 1270 1275 



Pro Asp Pro Leu Tyr Glu Val Met Leu Lys Cys Trp His Pro Lys 
1310 1315 1320 



Ala Glu Met Arg Pro Ser Phe Ser Glu Leu Val Ser Arg lie Ser 
1325 1330 1335 



Ala He Phe Ser Thr Phe He Gly Glu His Tyr Val His Val Asn 
1340 1345 1350 



Ala Thr Tyr Val Asn Val Lys Cys Val Ala Pro Tyr Pro Ser Leu 
1355 1360 1365 



Leu Ser Ser Glu Asp Asn Ala Asp Asp Glu Val Asp Thr Arg Pro 
1370 1375 1380 



Ala Ser Phe Trp Glu Thr Ser 
1385 1390 



<210> 13 

<211> 2558 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> gene 

<222> (1) . . (2558) 

<223> Chondroitin sulfate proteoglycan BEHAB/brevican mRNA, GPI isoform 
, complete cds 

<220> 

<221> CDS 

< 2 2 2 > (48).. (2063) 



c3i'ii> Gary,5.C, Ze r i 1 1 o , C . A . , 2hi ang , V . L . , Gaw, J.U. , Gray,G. and 
Hockf leld, S . 

<302> cDNA cloning, chromosomal localization, and expression analysis of 
human BEHAB/brevican, a brain specific proteoglycan regulated during cortical 
development and in glioma 

< 3 0 j. > Gene 
- 3m,> 2 56 

< 3 0 5 > 1 



<400> 13 

tgtggcactg cctgcgtacc caaccccagc cctgggtagc ctgcagc atg gcc cag 56 

Met Ala Gin 
1 

ctg ttc ctg ccc ctg ctg gca gcc ctg gtc ctg gcc cag get cct gca 104 
Leu Phe Leu Pro Leu Leu Ala Ala Leu Val Leu Ala Gin Ala Pro Ala 
5 10 15 

get tta gca gat gtt ctg gaa gga gac age tea gag gac cgc get ttt 152 
Ala Leu Ala Asp Val Leu Glu Gly Asp Ser Ser Glu Asp Arg Ala Phe 
20 25 30 35 

cgc gtg cgc ate gcg ggc gac gcg cca ctg cag ggc gtg etc ggc ggc 200 
Arg Val Arg lie Ala Gly Asp Ala Pro Leu Gin Gly Val Leu Gly Gly 
40 45 50 

qcc etc acc ate cct tgc cac qtc cac tac ctg egg cca ccg ccg age 248 
Ala Leu Thr lie Pro Cys His Val His Tyr Leu Arg Pro Pro Pro Ser 
55 60 65 

cgc egg get gtg ctg ggc tct ccg egg gtc aag tgg act ttc ctg tec 296 
Arg Arg Ala Val Leu Gly Ser Pro Arg Val Lys Trp Thr Phe Leu Ser 
70 75 80 

egg ggc egg gag gca gag gtg ctg gtg gcg egg gga gtg cgc gtc aag 344 
Arg Gly Arg Glu Ala Glu Val Leu Val Ala Arg Gly Val Arg Val Lys 

85 90 95 

gtg aac gag gcc tac egg ttc cgc gtg gca ctg cct gcg tac cca gcg 3 92 

Val Asn Glu Ala Tyr Arg Phe Arg Val Ala Leu Pro Ala Tyr Pro Ala 
100 105 110 115 

teg etc acc gac gtc tec ctg gcg ctg age gag ctg cgc ccc aac gac 440 
Ser Leu Thr Asp Val Ser Leu Ala Leu Ser Glu Leu Arg Pro Asn Asp 
120 125 130 

tea ggt ate tat cgc tgt gag gtc cag cac ggc ate gat gac age age 4 88 

Ser Gly lie Tyr Arg Cys Glu Val Gin His Gly lie Asp Asp Ser Ser 

13 5 14 0 145 

gac get gtg gag gtc aag gtc aaa ggg gtc gtc ttt etc tac cga gag c : 3 6 

Aep Ala Val Glu Val Lye Val Lys Gly Val Val Phe Leu Tyr Arg Glu 
150 155 160 

ggc tct gcc cgc tat get ttc tec ttt tct ggg gcc cag gag gcc tgt 584 
Gly Ser Ala Arg Tyr Ala Phe Ser Phe Ser Gly Ala Gin Glu Ala Cys 

165 17 0 175 

gee cgc att gga gcc cac ate.- gcc acc ccg gag cag etc tat gcc gcc 6 32 

Ala Arn Tie Glv Ala H: c T ] e Ala Thr Pr-~ Glu Gin Leu Tvr Ala Ala 



acc gtg agg tat ccc ate cag acc cca cga gag gec tgt tac gga gac 728 

Thr Val Arg Tyr Pro lie Gin Thr Pro Arg Glu Ala Cys Tyr Gly Asp 

215 220 225 

atg gat ggc ttc ccc ggg gtc egg aac tat ggt gtg gtg gac ccg gat ^76 

Met Asp Gly Phe Pro Gly Val Arg Asn Tyr Gly Val Val Asp Pro Asp 

230 235 240 

gac etc tat gat gtg tac tgt tat get gaa gac eta aat gga gaa ttg 824 

Asp Leu Tyr Asp Val Tyr Cys Tyr Ala Glu Asp Leu Asn Gly Glu Leu 

245 250 255 

ttc etg ggt gac ect eca gag aag ctg aca ttg gag gaa gca egg gcg 872 

Phe Leu Gly Asp Pro Pro Glu Lys Leu Thr Leu Glu Glu Ala Arg Ala 

260 265 270 275 

tac tgc cag gag egg ggt gca gag att gec acc acg ggc caa etg tat 92 0 

Tyr Cys Gin Glu Arg Gly Ala Glu lie Ala Thr Thr Gly Gin Leu Tyr 
280 285 290 

gca gee tgg gat ggt ggc etg gac cac tgc age cca ggg tgg eta get 968 

Ala Ala Trp Asp Gly Gly Leu Asp His Cys Ser Pro Gly Trp Leu Ala 

295 300 305 

gat ggc agt gtg cgc tac ccc ate gtc aca ccc age cag egc tgt ggt 1016 

Asp Gly Ser Val Arg Tyr Pro lie Val Thr Pro Ser Gin Arg Cys Gly 

310 315 320 

ggg ggc ttg cet ggt gtc aag act etc ttc etc ttc ccc aac cag act 1064 

Gly Gly Leu Pro Gly Val Lys Thr Leu Phe Leu Phe Pro Asn Gin Thr 

325 330 335 

ggc ttc ccc aat aag cac age cgc ttc aae gtc tac tgc ttc cga gac 1112 

Gly Phe Pro Asn Lys His Ser Arg Phe Asn Val Tyr Cys Phe Arg Asp 

340 345 350 355 

teg gee cag cet tct gec ate ect gag gec tec aac cca gec tee aac 1160 

Ser Ala Gin Pro Ser Ala lie Pro Glu Ala Ser Asn Pro Ala Ser Asn 
360 365 370 

y ca gee tct gat gga eta gag get ate gtc aca atg aca gag acc ztq 11:08 

Pro Ala Ser Asp Gly Leu Glu Ala He Val Thr Val Thr Glu Thr Leu 

3 7 5 3 8 0 3 8 5 

gac gaa ctg rag rtg ect cag gaa gec aca gag agt gaa tec egt ggg 12 5 6 

Glu Glu Leu Gin Leu Pre- Gin Glu Ala Thr Glu Ser Glu Ser Arg Gly 

390 395 400 

gee ate tac tec ate ccc ate atg gag gac gga gga ggt gga age tee 1304 

Ala He Tyr Ser He Pro He Met Glu Asp Gly Gly Gly Gly Ser Se: 

4 0 5 4 10 4 15 



Thr Gin Ser Met Val Pro Pro Thr Gly Phe Ser Glu Glu Glu Gly Lys 
440 445 450 

gca ttg gag gaa gaa gag aaa tat gaa gat gaa gaa gag aaa gag gag 144 8 

Ala Leu Glu Glu Glu Glu Lys Tyr Glu Asp Glu Glu Glu Lys Glu Glu 

455 460 465 

gaa gaa gaa gag gag gag gtg gag gat gag get ctg tgg gca tgg ccc 14 96 

Glu Glu Glu Glu Glu Glu Val Glu Asp Glu Ala Leu Trp Ala Trp Pro 

470 475 4 8 0 

age gag etc age age ccg ggc cct gag gee tct etc ccc act gag cea 1544 

Ser Glu Leu Ser Ser Pro Gly Pro Glu Ala Ser Leu Pro Thr Glu Pro 

485 490 495 

gca gec cag gag gag tea etc tee eag gcg cca gca agg gca gte ctg 1592 

Ala Ala Gin Glu Glu Ser Leu Ser Gin Ala Pro Ala Arg Ala Val Leu 

500 505 510 515 

eag cct ggt gca tea cca ctt cct gat gga gag tea gaa get tee agg 164 0 

Gin Pro Gly Ala Ser Pro Leu Pro Asp Gly Glu Ser Glu Ala Ser Arg 
520 525 530 

cct cca agg gtc cat gga cca cct act gag act ctg ccc act ccc agg 1688 

Pro Pro Arg Val His Gly Pro Pro Thr Glu Thr Leu Pro Thr Pro Arg 

535 540 545 

gag agg aac eta gca tec cca tea cct tec act etg gtt gag gca aga 1736 

Glu Arg Asn Leu Ala Ser Pro Ser Pro Ser Thr Leu Val Glu Ala Arg 

550 555 560 

gag gtg ggg gag gca act ggt ggt cct gag eta tct ggg gtc cct cga 1784 

Glu Val Gly Glu Ala Thr Gly Gly Pro Glu Leu Ser Gly Val Pro Arg 

565 570 575 

gga gag age gag gag aca gga age tee gag ggt gec cct tec etg ctt 1832 

Gly Glu Ser Glu Glu Thr Gly Ser Ser Glu Gly Ala Pro Ser Leu Leu 

580 585 590 595 

:ca gee aca egg gee cct gag ggt ac: agg gag etg gag gee :ec t :i 188 0 

Pro Ala Tlir Arg Ala Pro Glu Gly Thr Arg Glu Leu Glu Ala Pro Ser 
6 0 0 6 0 5 6 1 0 

gaa gat aat tct gga aga act gec cca gca ggg ace tea gtg eag gec 192 8 

Glu Asp Asn Ser Gly Arg Thr Ala Pro Ala Gly Thr Ser Val Gin Ala 

615 62 0 625 

eag cca gtg ctg ccc act gac age gc:: age cga ggt gga gtg gec gtg 1976 

Gin Pro Val Leu Pro Thr Asp Ser Ala Ser Arg Gly Gly Val Ala Val 

63C 635 64C 



66 0 665 670 

tttaacecac catcatccca aactctcctg tcctttgcct tcattctctt acccacctct 2133 

acctatgggt ct:caatctc ggatatccac cttgtgggta tctcagctct ccgcgtcttt 2193 

accctgtgat cccagccccg ccactgacca tctgtgaccc ttccctgcca ttgggccctc 2253 

cacctgtggc tcacatctcg ccagccccac agagcatcct caggcctctc caagggtcct 2313 

catcacctat tgcagccttc agggctcggc ctattttcca ctactccctt catccgcctg 2373 

tgtgccgtcc cctttagctg cctcctattg atctcaggga agcctgggag tcccttctca 2433 

cccctcaacc tccggagtcc aggagaaccc gtacccccac agagccttaa gcaactactt 2493 

ctgtgaagta ttttttgact gtttcatgga aaacaagcct tggaaataaa tctctattaa 2553 

accgc 2558 



<210> 14 

<211> 671 

<212> ?RT 

<213> Homo sapiens 

<220> 

<221> gene 

<222> (1) . . (671) 

<22 3> Chondroitin sulfate proteoglycan BEHAB/brevican 



<400> 14 

Met Ala Gin Leu Phe Leu Pro Leu Leu Ala Ala Leu Val Leu Ala Gin 
15 10 15 



Ala Pro Ala Ala Leu Ala Asp Val Leu Glu Gly Asp Ser Ser Glu Asp 

2 0 2 5 3 0 



3 5 4 0 4 5 



Leu Gly Gly Ala Leu Thr lie Pro Cys His Val His Tyr Leu Arg Pro 

50 55 60 



Pr- Pv- A"! n > •: : s "Vi T.r 



rev Pi-r hra Val T.vs Trr Thr 



Arg Val Lys Val Asn Glu Ala Tyr Arg Phe Arg Val Ala Leu Pro Ala 
100 105 110 



Tyr Pro Ala Ser Leu Thr Asp Val Ser Leu Ala Leu Ser Glu Leu Arg 
115 120 125 



Pro Asn Asp Ser Gly lie Tyr Arg Cys Glu Val Gin His Gly lie Asp 
130 135 140 



Asp Ser Ser Asp Ala Val Glu Val Lys Val Lys Gly Val Val Phe Leu 
145 150 155 160 



Tyr Arg Glu Gly Ser Ala Arg Tyr Ala Phe Ser Phe Ser Gly Ala Gin 
165 170 175 



Glu Ala Cys Ala Arg lie Gly Ala His lie Ala Thr Pro Glu Gin Leu 
180 185 19 0 



Tyr Ala Ala Tyr Leu Gly Gly Tyr Glu Gin Cys Asp Ala Gly Trp Leu 
195 200 205 



Ser Asp Gin Thr Val Arg Tyr Pro lie Gin Thr Pro Arg Glu Ala Cys 
210 215 220 



Tyr Gly Asp Met Asp Gly Phe Pro Gly Val Arg Asn Tyr Gly Val Val 
225 230 235 240 



Asp Pro Asp Asp Leu Tyr Asp Val Tyr Cys Tyr Ala Glu Asp Leu Asn 
245 250 255 



My Glu Leu Phe Leu Glv Asp Pro Pro Glu Lvs Leu Thr Leu Glu Glu 



Ala Arg Ala Tyr Cys Gin Glu Arg Gly Ala Glu lie Ala Thr Thr Gly 
275 280 285 



Leu . yr ^ a Tip z-.sp Giy G;y a-;; ^cp mis 

2 9 0 2 9 5 3 0 C 



Arg Cys Gly Gly Gly Leu Pro Gly Val Lys Thr Leu Phe Leu Phe Pro 
325 330 335 



Asn Gin Thr Gly Phe Pro Asn Lys His Ser Arg Phe Asn Val Tyr Cys 
340 3 45 3 5 0 



Phe Arg Asp Ser Ala Gin Pro Ser Ala lie Pro Glu Ala Ser Asn Pro 
355 360 365 



Ala Ser Asn Pro Ala Ser Asp Gly Leu Glu Ala lie Val Thr Val Thr 
370 375 380 



Glu Thr Leu Glu Glu Leu Gin Leu Pro Gin Glu Ala Thr Glu Ser Glu 
385 390 395 400 



Ser Arg Gly Ala lie Tyr Ser lie Pro lie Met Glu Asp Gly Gly Gly 
405 410 415 



Gly Ser Ser Thr Pro Glu Asp Pro Ala Glu Ala Pro Arg Thr Leu Leu 
420 425 430 



Glu Phe Glu Thr Gin Ser Met Val Pro Pro Thr Gly Phe Ser Glu Glu 
435 440 445 



;iu Gly Lys Ala Leu Glu Glu Glu Glu Lys Tyr Glu Asp Glu Glu Glu 
450 455 460 



Lys Glu Glu Glu Glu Glu Glu Glu Glu Val Glu Asp Glu Ala Leu Trp 
465 470 47 5 480 



?er Glu Leu Ser Gei Pro Glv Pro Glu Ala Ser Leu Pi 
4 8 5 4 9 0 4 9 5 



Thi Glu Pic Ala Ala Gin Glu Glu Ser Leu Set Gin Ala Pro Ala Arg 
503 505 510 



Ja Val Leu Gin Pro Gly Ala Ser Pre Leu Pro Asp Gly Glu Ser Glu 

515 5 2 0 5 2 5 



54 5 550 555 560 



Glu Ala Arg Glu Val Gly Glu Ala Thr Gly Gly Pro Glu Leu Ser Gly 

5 65 570 575 



Val Pro Arg Gly Glu Ser Glu Glu Thr Gly Ser Ser Glu Gly Ala Pro 
580 585 590 



Ser Leu Leu Pro Ala Thr Arg Ala Pro Glu Gly Thr Arg Glu Leu Glu 
595 60 0 605 



Ala Pro Ser Glu Asp Asn Ser Gly Arg Thr Ala Pro Ala Gly Thr Ser 
610 615 6 2 0 



Val Gin Ala Gin Pro Val Leu Pro Thr Asp Ser Ala Ser Arg Gly Gly 
6::5 630 635 640 



Val Ala Val Val Pro Ala Ser Gly Asn Ser Ala Gin Gly Ser Thr Ala 
645 650 655 



Leu Ser lie Leu Leu Leu Phe Phe Pro Leu Gin Leu Trp Val Thr 
660 665 670 



:10> 15 

;il> 2316 

:12> DNA 

:13> Homo sapiens 

: 2 0 > 

:21> gene 

:22> (1) . . (2316) 

:I:3> Human mRNA for CD44E (epithelial form) 



2 1 



,24; . . (16Gb. 



<30 0 > 

■:j-01> Stamenkovic , I . , Aruf fo, A. , Amiot,M. and Seed,B. 

<>02> The hematopoietic and epithelial forms of CD44 are distinct 

po] ypept ides with different adhesion potentials for hya 1 u ronat e - bea r i ng ce 

< i0 3> EMBO J. 



< 4 0 0 > 15 

agrggacccc agcctctgce aggttcggtc cgccatcctc gtcccgtcct ccgccggccc 60 

ctgccccgcg c^cagggatc ctccagctcc tttcgcccgc gccctccgtt cgctccggac 120 

acc atg gac aag ttt tgg tgg cac gca gcc tgg gga etc tgc etc gtg 163 

Met Asp Lys Phe Trp Trp His Ala Ala Trp Gly Leu Cys Leu Val 

15 10 15 

ccg ctg age ctg gcg cag ate gat ttg aat ata acc tgc cgc ttt gca 216 

Pro Leu Ser Leu Ala Gin lie Asp Leu Asn He Thr Cys Arg Phe Ala 

20 25 30 

ggt gta ttc cac gtg gag aaa aat ggt cgc tae age ate tct egg acg 264 

Gly Val Phe His Val Glu Lys Asn Gly Arg Tyr Ser He Ser Arg Thr 

35 40 45 

gag gee get gac etc tgc aag get ttc aat age acc ttg ccc aca atg 312 

Glu Ala Ala Asp Leu Cys Lys Ala Phe Asn Ser Thr Leu Pro Thr Met 

50 55 60 

gcc cag atg gag aaa get ctg age ate gga ttt gag acc tgc agg tat 360 

Ala Gin Met Glu Lys Ala Leu Ser lie Gly Phe Glu Thr Cys Arg Tyr 

65 70 75 

ggg ttc ata gaa ggg cat gtg gtg att ccc egg ate cac ccc aac tec 408 

Gly Phe He Glu Gly His Val Val He Pro Arg He His Pro Asn Ser 

SO 85 90 95 

ate tgt gca gca aac aac aca ggg gtg tac ate etc aca tae aac acc 456 

lie Cys Ala Ala Asn Asn Thr Gly Val Tyr He Leu Thr Tyr Asn Thr 

100 105 110 

tec cag tat gac aca tat tgc ttc aat get tea get cca cct gaa gaa 504 

Ser Gin Tyr Asp Thr Tyr Cys Phe Asn Ala Ser Ala Pro Pro Glu Glu 

115 120 125 

gat tgt aca tea gtc aca gac ctg ccc aat gcc ttt gat gga cca att 5 52 

Asp Cys Thr Ser Va J Thr Asp Leu Pro Asn Ala Phe Asp Gly Pro He 

13 0 13 5 14 3 

acc ata act att gtt aac cgt gat. ggc acc cgc tat gtc cag aaa gga 6 00 

Thr He Thr He Val Asn Arg Asp Gly Tin Arg Tyi Val Gin Lys Gly 

145 150 155 

gaa tac aga acg aat cct gaa gac ate tac ccc age aac cct act gat 648 

Glu Tyr Arg Thr Asn Pro Glu Asp He Tyr Pro Ser Asn Pro Thr Asp 

160 1 65 HO 17 5 



195 200 205 

gac agt ccc tgg ate acc gac age aca gac aga ate cct cgt acc aat 792 

Asp Ser Pro Trp lie Thr Asp Ser Thr Asp Arg lie Pro Arg Thr Asn 

210 215 220 

atg gac tec agt cat agt aca acg ctt cag cct act gca aat cca aac 840 

Met Asp Ser Ser His Ser Thr Thr Leu Gin Pro Thr Ala Asn Pro Asn 

225 230 235 

aca ggt ttg gtg gaa gat ttg gac agg aca gga cct ctt tea atg aca 888 

Thr Gly Leu Val Glu Asp Leu Asp Arg Thr Gly Pro Leu Ser Met Thr 
240 245 250 255 

acg cag cag agt aat tct cag age ttc tct aca tea cat gaa ggc ttg 936 

Thr Gin Gin Ser Asn Ser Gin Ser Phe Ser Thr Ser His Glu Gly Leu 

260 265 270 

gaa gaa gat aaa gac cat cca aca act tct act ctg aca tea age aat 984 

Glu Glu Asp Lys Asp His Pro Thr Thr Ser Thr Leu Thr Ser Ser Asn 

275 280 235 

agg aat gat gtc aca ggt gga aga aga gac cca aat cat tct gaa ggc 1032 

Arg Asn Asp Val Thr Gly Gly Arg Arg Asp Pro Asn His Ser Glu Gly 
290 295 300 

tea act cat tta ctg gaa ggt tat acc tct cat tac cca cac acg aag 1080 

Ser Thr His Leu Leu Glu Gly Tyr Thr Ser His Tyr Pro His Thr Lys 
305 310 315 

gaa age agg acc ttc ate eea gtg acc tea get aag act ggg tec ttt 1128 

Glu Ser Arg Thr Phe lie Pro Val Thr Ser Ala Lys Thr Gly Ser Phe 
320 325 330 335 

gga gtt act gca gtt act gtt gga gat tec aac tct aat gtc aat cgt 1176 

Gly Val Thr Ala Val Thr Val Gly Asp Ser Asn Ser Asn Val Asn Arg 

340 345 350 

tec tta tea gga gac caa gac aca ttc cac ccc agt ggg ggg tec cat 1224 

Ser Leu Ser Gly Asp Gin Asp Thr Phe His Pro Ser Gly Gly Ser His 

: : — 3 6 0 3 6 5 

acc act cat. gga tcr gaa tea gat gga cac tea cat. ggg agt caa gaa 1.2 "2 

Thr Thr His Gly Ser Glu Ser Asp Gly His Ser His Gly Ser Gin Glu 
3 7 0 3 7 5 3 8 C 

'igt 99 a 9 ca aac aca acc tct ggt cct ata agg aca ccc caa att cca 1320 

Gly Gly Ala Asn Thr Thr Ser Gly Pro lie Arg Thr Pro Gin lie Pro 

385 .3 9 0 395 

naa ^aa ^+- a „- ♦ ^ ^+ ft a aca t rtr tta acc tra act tta att ctt 1368 



aag eta gtg ate aac agt ggc aat gga get gtg gag gac aga aag cca 1464 
Lys Leu Val lie Asn Ser Gly Asn Gly Ala Val Glu Asp Arg Lys Pro 
435 44 0 445 

agt gga etc aac gga gag gee age aag tct eag gaa atg gtg cat ttg 15 12 
Ser Gly Leu Asn Gly Glu Ala Ser Lys Ser Gin Glu Met Val His Leu 
450 455 460 

gtg aac aag gag teg tea gaa act cca gae eag ttt atg aca get gat 1560 
Val Asn Lys Glu Ser Ser Glu Thr Pro Asp Gin Phe Met Thr Ala Asp 
465 470 475 

gag aca agg aac etg eag aat gtg gac atg aag att ggg gtg taa 1605 
Glu Thr Arg Asn Leu Gin Asn Val Asp Met Lys lie Gly Val 
4S0 485 490 

cacctacacc attatettgg aaagaaacaa cgttggaaae ataaecatta caggggagct 1665 

gggacactta aeagatgeaa tgtgctactg attgtttcat ttcgaatcta taatagcata 1725 

aaattttcta ctctttttgt tttttgtgtt ttgttcttta aagteaggtc caatttgtaa 1785 

aaacagcatt getttetgaa attagggece aattaataat cagcaagaat tttgatcgtt 1845 

tcagttcccc acttggagge ctttcatccc tcgggtgtgc tatggatggc ttctaacaaa 1905 

aacctaccac atagttatte ctgatcgcca accttgcccc ccaceagcta aggacatttc 1965 

eagggttaat agggcctggt cetgggagga aatttgaatg ggtcattttg cccttccatt 2025 

agcctaatcc ctgggcattg ctttecactg aggttggggg ttggggtgta ctagttacac 2085 

atcttcaaca gaccccctct agaaattttt cagatgette tgggagacac ccaaagggta 2145 

actctattta tetgtagtaa aetatttatc tgtgtttttg aaatattaaa ccctggatca 2205 

gtccttttat teagtataat tttttaaagt tactttgtca gaggcacaaa aagggtttaa 2265 

actgattcat aataaatatc tgtaeettct tcgaaaaaaa aaaaaaaaaa a 2 3 16 



. ; 1 7 4 2 

. 1 12..: PRT 

,"13 ; - Homo sapiens 

:22 0> 

. 2 1 SIGNAL 

: 2 2 .. ( 1 } . . ( 2 0 j 

2 2 \ BY SIMILARITY 




< 2 2 0 - 

<2 21.* CHAIN 

< 2 2 2 ( 2 1 ) . . C 4 2 ) 
<22j-- CD4 4 ANTIGEN 

<: 2 2 0 ■ 

<:221> DOMAIN 

<222 > CI).. (649) 

<2 2 3 • EXTRACELLULAR (POTENTIAL) 

< 2 2 0 ■> 

<221> TRANSMEM 

<222> (650) . . (670) 

<2 23> POTENTIAL 

< 2 2 0 > 

<221> DOMAIN 

<222> (671) . . (742) 

< 2 2 3 > CYTOPLASMI C ( POTENTIAL ) 

< 2 2 0 > 

<221> DOMAIN 

<222> (48) . . (119) 

<22 3> LINK 

<220> 

<221> MOD_RES 

<222> (21) . . (21) 

<22 3> PYRROL I E>ONE CARBOXYLIC ACID (PROBABLE) 
<220> 

<221> DISULFID 

<222> (53) . . (118) 

<223> BY SIMILARITY 

<22 0> 

<221> DISULFID 

<222> (77) . . (97) 

<223> BY SIMILARITY 



•.0.\ 1 :• CARBOHYE> 

. 1. il 5 ) . . C 5) 

N IjINKEI' (GLCNAC . (POTENTIAL) 

; o> 

-221:. CARBOHYD 

-222- (57) . . (57) 

v.Tj, N-LINKED (GLCNAC . ..) (POTENTIAL) 

< 2 2 0 



. n o o , 
: 2 2 3 : 



(110). .(110) 
N- LINKED ( GLCNA'I 



( POTENTIAL) 



CARBOHYD 
• 12G • . . (12 0) 
N- LINKED (GLCNAC 



[POTENTIAL) 



: J 2 : 



CARBOHYD 
(350) . . (35(0 
N- LINKED (GLCNAC . 



. . ) (POTENTIAL) 



CARBOHYD 
(54 8 ) . . (548) 
N- LINKED (GLCNAC 



( POTENTIAL) 



;221> CARBOHYD 
<222> (599) . . (599) 

> N- LINKED (GLCNAC 



) (POTENTIAL) 



] 1> 



CARBOHYD 
(636) . . (636) 
N- LINKED (GLCNAC 



) (POTENTIAL) 



VARSPLIC 
(23) . . (29) 

DLNITCR -> GVGRRKS (IN ISOFORM CD44SP) 



) l > 



VARSPLIC 
(30) . . (74 2 ) 

MISSING (IN ISOFORM CD44SP) 



2 2 U > 



VARSPLIC 



1 G ^ ) ;' 1 



> A 'IN ISOFORM WITH ALTERNATIVE SPLICE DONOR ''ACCEPTOR ON EXON 



VARSPLIC 
■■193) . . (22:. ) 

MISSING (IN ISOFORM WITH ALTERNATIVE SPLICE DONOR/ ACCEPTOR ON EXO 

N S) 






(224) . . (266) 

MISSING (IN ISOFORM WITHOUT EXON 6) 




VARSPLI C 
(223 ) . . (223 ) 

T > II (IN ISOFORMS WITHOUT EXONS 6 TO 11) 





VARSPLIC 
(223) . . (223) 

T -> R (IN ISOFORMS WITHOUT EXONS 6 TO 14 ) 



<221> VARSPLIC 

<222> (224) . . (604 ) 

<223> MISSING (IN ISOFORMS WITHOUT EXONS 6 TO 14 ) 
< 2 2 0 > 

<221> VARSPLIC 

<222> (266) . . (273) 

<223> MISSING (IN ISOFORM WITH ALTERNATIVE SPLICE DONOR/ ACCEPTOR ON EXO 



N 7) 




VARSPLIC 
(385) . . (385) 

I - > T (IN ISOFORMS WITHOUT EXON 10) 



<220> 

<221> VARSPLIC 



<222 > (386). .(428) 

<223> MISSING (IN ISOFORMS WITHOUT EXON 10) 



V/iRSPLIC 
i 5l 6 ) . . ( 
C - ■ K ■ 



( 5 0 h 



I N 



i. 1 



VARSPLIC 
(507) . . (535) 

MISSING {IN ISOFORMS WITHOUT EXON 13) 



•220 > 
c 2 2 1 > 



VARSPLIC 



13> MISSING (IN ISOFORMS WITHOUT EXON 14; 



<220 
<221 
c - ~> 2 2 
<223 



VARSPLIC 
(675) . . (6^5) 

R -> S (IN ISOFORMS WITHOUT EXON 19) 



< 2 2 0 : 
<221: 
<222: 

< 2 2 3 : 



VARSPLIC 
(676) . . (742) 

MISSING (IN ISOFORMS WITHOUT EXON 19) 



<22 0 > 
<221> 
< 2 2 2 > 
<223:- 



VARIANT 
( 4 6 ) . . ( 4 6 ) 

R -> P {IN IN{A) ANTIGEN) 



<400:- 16 

Met Asp Lys Phe Trp Trp His Ala Ala Trp Gly Leu Cys Leu Val Pro 
15 10 15 



Leu Ser Leu Ala Gin lie Asp Leu Asn lie Thr Cys Arg Phe Ala Gly 
20 25 30 



Val Phe His Val Glu Lys Asn Gly Arg Tyr Ser lie Ser Arg Thr Glu 
35 40 45 



Ala Ala Asp Leu Cys Lys Ala Phe Asn Ser Thr Leu Pro Thr Met Ala 
50 55 60 



Gin Met Glu Lys Ala Leu Ser lie Gly Phe Glu Thr Cys Arg Tyr Gly 
65 7 0 75 80 



Prifr .1 i t. j G i u G i y H] : 
8 5 



ie Pro Arg lie His Pro Asn Ser lie 
9 0 9 5 



Ala Ala Asn Asn Thr Gly Val Tyr lie Leu Thr Ser Asn Thr Ser 
100 105 110 



Gin Tyr Asp Thr Tyr Cys Phe Asn Ala Ser Ala Pro Pro Glu Glu Asp 

115 120 125 



145 150 155 160 



Tyr Arg Thr Asn Pro Glu Asp lie Tyr Pro Ser Asn Pro Thr Asp Asp 
165 17 0 175 



Asp Val Ser Ser Gly Ser Ser Ser Glu Arg Ser Ser Thr Ser Gly Gly 
180 185 190 



Tyr lie Phe Tyr Thr Phe Ser Thr Val His Pro He Pro Asp Glu Asp 
195 200 205 



Ser Pro Trp He Thr Asp Ser Thr Asp Arg He Pro Ala Thr Thr Leu 
210 215 22 0 



Met Ser Thr Ser Ala Thr Ala Thr Glu Thr Ala Thr Lys Arg Gin Glu 
225 230 235 240 



Thr Trp Asp Trp Phe Ser Trp Leu Phe Leu Pro Ser Glu Ser Lys Asn 
245 250 255 



His Leu His Thr Thr Thr Gin Met Ala Gly Thr Ser Ser Asn Thr He 
260 265 270 



Ser Ala Gly Trp Glu Pro Asn Glu Glu Asn Glu Asp Glu Arg Asp Arg 
275 280 285 



His Leu Ser Phe Ser Gly Ser Gly lie Asp Asp Asp' Glu Asp Phe lie 
290 295 300 



Ser Ser Thr lie Ser Thr Thr Pro Arg Ala Phe Asp His Thr Lys Gin 



Asn Gin Asp Trp Thr Gin Trp Asn Pro Ser His Ser Asn Pro Glu Val 

325 330 335 



Leu Leu Gin Thr Thr Thr Arg Met Thr Asp Val Asp Arg Asn Gly Thr 

340 345 350 



lie Gin Ala Thr Pro Ser Ser Thr Thr Glu Glu Thr Ala Thr Gin Lys 

38S 390 395 4 00 



Glu Gin Trp Phe Gly Asn Arg Trp His 3lu Gly Tyr Arg Gin Thr Pro 
405 410 415 



Arg Glu Asp Ser His Ser Thr Thr Gly Thr Ala Ala Ala Ser Ala His 
420 425 430 



Thr Ser His Pro Met Gin Gly Arg Thr Thr Pro Ser Pro Glu Asp Ser 
43 5 440 445 



Ser Trp Thr Asp Phe Phe Asn Pro lie Ser His Pro Met Gly Arg Gly 
450 455 460 



His Gin Ala Gly Arg Arg Met Asp Met Asp Ser Ser His Ser Thr Thr 
465 470 475 480 



Leu Gin Pro Thr Ala Asn Pro Asn Thr Gly Leu Val Glu Asp Leu Asp 
485 490 495 



Arg Thr Gly Pro Leu Ser Met Thr Thr Gin Gin Ser Asn Ser Gin Ser 
500 505 510 



Phe Ser Thr Ser His Glu Gly Leu Glu Glu Asp Lys Asp His Pro Thr 
515 520 525 



Thr Ser Thr Leu Thr Ser Ser Asn Arg Asn Asp Val Thr Gly Gly Arg 

530 53 5 540 



Aiq Asp Pre As:: Hi c Ser Glu Gly Sei Thr Thr Leu Lev; Glu Gly Tyi 
545 550 555 560 



Thr Ser His Tyr Pro His Thr Lys Glu Ser Arg Thr Phe lie Pro Val 
565 570 575 



Thr Ser Ala Lys Thr Gl 



Ser Phe GIy Va ; Thr Ala Val Thr Val Gl\ 



Phe His Pro Ser Gly Gly Ser His Thr Thr His Gly Ser Glu Ser Asp 

610 615 620 



Gly His Ser His Gly Ser Gin Glu Gly Gly Ala Asn Thr Thr Ser Gly 



630 



635 



640 



Pro lie Arg Thr Pro Gin lie Pro Glu Trp Leu lie lie Leu Ala Ser 
645 650 655 



Leu Leu Ala Leu Ala Leu lie Leu Ala Val Cys lie Ala Val Asn Ser 

660 665 670 



Arg Arg Arg Cys Gly Gin Lys Lys Lys Leu Val lie Asn Ser Gly Asn 
675 680 685 



Gly Ala Val Glu Asp Arg Lys Pro Ser Gly Leu Asn Gly Glu Ala Ser 

690 695 700 



Lys Ser Gin Glu Met Val His Leu Val Asn Lys Glu Ser Ser Glu Thr 
705 710 715 720 



Pro Asp Gin Phe Met Thr Ala Asp Glu Thr Arg Asn Leu Gin Asn Val 
725 730 735 



Asp Met Lys lie Gly Val 
740 



<210> 17 
< 2 1 1 > 9 3 0 



12 



<22 3 

<22 0 
< 2 2 1 



DNA 

Heine sap l en:; 



Gene 

( 1 ) ■ ■ 19 3 0) 

Tetraspan TM4SF (TSPAN-3 1 



CDS 

1 1 j . . { 76: 



mRNA 




<304> 1399 

< 3 0 5 > 1 

<3 06:- 101-104 

<3 07;. 19 98 

<309> 

<313;- (1) . . (930) 



< 4 0 0 17 

at 9 ggc cag tgc ggc ate acc tec tec aag acc gtg ctg gtc ttt etc 48 

Met Gly Gin Cys Gly lie Tnr Ser Ser Lys Thr Val Leu Val Phe Leu 

15 10 15 

aac etc ate ttc tgg ggg gca get ggc att tta tgc tat gtg gga gee 96 

Asn Leu He Phe Trp Gly Ala Ala Gly He Leu Cys Tyr Val Gly Ala 

2 0 2 5 3 0 

tat gtc ttc ate act tat gat gac tat gac cac ttc ttt gaa gat gtg 144 

Tyr Val Phe He Thr Tyr Asp Asp Tyr Asp His Phe Phe Glu Asp Val 

35 40 45 

tac acg etc ate eet get gta gtg ate ata get gta gga gee ctg ctt 192 

Tyr Thr Leu He Pro Ala Val Val He He Ala Val Gly Ala Leu Leu 

50 55 60 

ttc ate att ggg eta att ggc tgc tgt gee aca ate egg gaa agt egc 240 

Phe He lie Gly Leu He Gly Cys Cys Ala Thr He Arg Glu Ser Arg 

65 70 75 80 

tgt gga ett gee aeg ttt gtc ate ate ctg etc ttg gtt ttt gtc aca 288 

Cys Gly Leu Ala Thr Phe Val lie lie Leu Leu Leu Val Phe Val Thr 

85 90 95 

gaa gtt gtt gta gtg gtt ttg gga tat gtt tac aga gca aag gtg gaa 336 

Glu Val Val Val Val Val Leu Gly Tyr Val Tyr Arg Ala Lys Val Glu 

100 105 110 

aat gag gtt gat egc age att cag aaa gtg tat aag acc tac aat gga 384 

Asn Glu Val Asp Arg Ser lie Gin Lys Val Tyr Lys Thr Tyr Asn Gly 

115 120 125 

acc aae ect gat get get age egg get. att. gat tat gta rag aga rag 4 32 

Thr As:: Pre- Asp Aia Ala Set." Arg Ala lie Asp Tyr Val Gin Arg Gin 

13 0 13 5 14 0 

ctg cat tgt tgt gga att eae aae tac tea gac tgg gaa aat aca gat 480 

Leu His Cys Cys Gly lie His Asn Tyr Ser Asp Trp Glu Asn Thr Asp 

145 150 155 160 

tgg ttc aaa gaa acc aaa aae rag agt gtc ect ctt age tgc tgc aga 52 8 

Trt) Phe Lvs Glu Thr Lys Asn Gin Ser Val Pro Leu Ser 2vs Cys Ara 



tat get gag ggg tgt gag get eta gtt gtg aag aag eta caa gaa ate 624 
Tyr Ala Glu Gly Cys Glu Ala Leu Val Val Lys Lys Leu Gin Glu lie 
195 200 205 

atg atg cat gtg ate tgg gee gca ctg gca ttt gca get att cag ctg 672 
Met Met His Val lie Trp Ala Ala Leu Ala Phe Ala Ala lie Gin Leu 
210 215 220 

ctg ggc atg ctg tgt get tgc ate gtg ttg tgc aga agg agt aga gat 720 
Leu Gly Met Leu Cys Ala Cys lie Val Leu Cys Arg Arg Ser Arg Asp 
225 230 235 240 

cct get tac gag etc etc ate act ggc gga acc tat gca tag 762 
Pro Ala Tyr Glu Leu Leu lie Thr Gly Gly Thr Tyr Ala 
245 250 

ttgacaactc ttgectgage tttttggtct tgttctgatt tggaaggtga attgagcagg 822 

tetgetgetg ttggcctctg gagttcattt agttaaagca catgtacact ggtgttggac 882 

agagcagctt ggcttttcat gtgcccaact acttactact actgegat 930 



<210> 18 

<211> 253 

<212> PRT 

<213> Homo sapiens 

< 2 2 0 > 

<221> DOMAIN 

<222> (1) . . (11) 

<223> Cytoplasmic (Potential) 

<220> 

<221> Gene 

<222> ( 1 ) . . (253) 

<223> Tetraspanin- 3 



< 2 2 0 > 



TKANSMEM 

(12) . . f 32 ) 
Potential 



DC >MA IN 
(3 3) . . (50) 

Extracellular ( Potent ral ) 



220 
22 1 



TF.ANSMEM 

( 5 1 ) (71 ) 




<220> 

<221:> TRANSMEM 

< 2 : • ( 8 6 } . . ( 1 o 6 ) 
<223 Potential 

< 2 2 0 :■■ 

c2 21:- DOMAIN 

<222:- ( 107) . . (212) 

<223> Extracellular (potential) 

< 2 2 0 > 

<221> TRANSMEM 

<222> (213) . . (233) 

<223 :- Potential 

< 2 2 0 > 

<221> DOMAIN 

<222> (234) . . (253) 

<22 3> Cytoplasmic (potential) 

< 2 2 0 > 

<221> CARBOHYD 

<222> ( 127) . . (127) 

<223> II -linked (GLCNAC...) (potential) 

< 2 2 0 > 

<22I> CARBOHYD 

<222> ( 152) . . (152) 

<223> N-lmked (GLCNAC. . . ) (potential) 
<220> 

<221> CARBOHYD 

<222> { 167) . . (167) 

<223> N-lmked (GLCNAC...) (potential) 

< 2 2 0 > 

< 2 2 1 > CARBOHYD 
<222> (183) . . (183) 

<: ■ ■ N-lmked (GLCNAC. . . (potential) 



Met Gly Gin Cys Gly lie Thr Ser Ser Lys Thr Vai Leu Val Phe Leu 
15 10 15 



A^n Leu He Phe Trp Gly Ala Ala Gly lie Leu Cys Tyr Val Gly Ala 



50 55 60 



Phe lie lie Gly Leu lie Gly Cys Cys Ala Thr lie Arg Glu Ser Arg 
65 70 75 80 



:ys Gly Leu Ala Thr Phe Val lie lie Leu Leu Leu Val Phe Val Thr 

85 90 95 



Glu Val Val Val Val Val Leu Gly Tyr Val Tyr Arg Ala Lys Val Glu 
100 105 110 



Asn Glu Val Asp Arg Ser lie Gin Lys Val Tyr Lys Thr Tyr Asn Gly 
115 120 125 



Thr Asn Pro Asp Ala Ala Ser Arg Ala lie Asp Tyr Val Gin Arg Gin 
130 135 140 



Leu His Cys Cys Gly lie His Asn Tyr Ser Asp Trp Glu Asn Thr Asp 
145 150 155 160 



Trp Phe Lys Glu Thr Lys Asn Gin Ser Val Pro Leu Ser Cys Cys Arg 
165 170 175 



Glu Thr Ala Ser Asn Cys Asn Gly Ser Leu Ala His Pro Ser Asp Leu 
180 185 190 



Tyr Ala Glu Gly Cys Glu Ala Leu Val Val Lys Lys Leu Gin Glu lie 
195 200 205 



Met Met His Val lie Trp Ala Ala Leu Ala Phe Ala Ala He Gin Leu 

210 215 220 



Leu Gly Met Leu Cys Ala Cys He Val Leu Cys Arg Arg Ser Arg Asp 

22 5 230 235 24 0 



Pro Ala Tyr Glu Leu Leu He Thr Gly Gly Thr Tyr Ala 



245 



2 50 



<221> Gene 

<222> (1) . . (1317) 

■;223 > Vasoactive Intestinal Peptide Receptor-2 
■;220:- 

<221> CDS 

<222> (1) . . (1317) 

<223> 

< 3 0 0 > 

<301> Svoboda , M . , Tastenoy, M . , Van Rampelbergh, J . , Goossens, J.F., De 

Neef, P., Waelbroeck, M. and Robberecht, P. 

<;302> Molecular cloning and functional characterization of a human VIP 
receptor from SUP-T1 lymphoblasts 205 (3), 1617-1624 (1994) 

<303> Biochem. Biophys . Res. Commun . 

<304> 205 

<:305> 3 

<306> 1617-1624 

<307> 1994 
<309> 

<313> (1) . . (1317) 



<400> 19 

atg egg acg ctg ctg cct ccc gcg ctg ctg acc tgc tgg ctg etc gec 48 

Met Arg Thr Leu Leu Pro Pro Ala Leu Leu Thr Cys Trp Leu Leu Ala 

15 10 15 

ccc gtg aac age att cac cca gaa tgc cga ttt cat ctg gaa ata cag 96 

Pro Val Asn Ser lie His Pro Glu Cys Arg Phe His Leu Glu lie Gin 

20 25 30 

gag gaa gaa aca aaa tgt aca gag ctt ctg agg tct caa aca gaa aaa 144 

Glu Glu Glu Thr Lys Cys Thr Glu Leu Leu Arg Ser Gin Thr Glu Lys 

35 40 45 

cac aaa gec tgc agt ggc gtc tgg gac aac ate acg tgc tgg egg cct 192 

His Lys Ala Cys Ser Gly Val Trp Asp Asn lie Thr Cys Trp Arg Pro 

50 55 60 

: "C aat gtg gga gag acc gtc acg gtg ccc tgc cca aaa gtc tte age 240 

Ala Asn Val Glv Glu Thr Val Thr Val Pro Cvs Pro Lvs Val Phe Ser 



aat ttt tac age aaa gca gga aac ata age aaa aac tgt acg agt gac 2 88 

Asn Phe Tyr Ser Lys Ala Gly Asn He Ser Lys Asn Cys Thr Ser Asp 

85 90 95 

gga tgg tea gag acg tte cca gat tte gtc gat gec tgt ggc tac age 3 36 

Gly Tip Ser Glu Thr Phe Pie Asp Phe Val Asp Ala Cys Gly Tyr Sei 



lie Tyr Thr Leu Gly Tyr Ser Val Ser Leu Met Ser Leu Ala Thr Gly 

130 135 140 

age ata att ctg tgc etc tt : agg aag ctg cac tgc arc agg aat tac 480 

Ser lie lie Leu Cys Leu Phe Arg Lys Leu His Cys Thr Arg Asn Tyr 

145 150 155 160 

ate cac ctg aac ctg ttc ctg tec ttc ate ctg aga gee ate tea gtg 528 

lie His Leu Asn Leu Phe Leu Ser Phe lie Leu Arg Ala lie Ser Val 
165 170 175 

ctg gtc aag gac gac gtt etc tac tec age tct ggc acg ttg cac tgc 576 

Leu Val Lys Asp Asp Val Leu Tyr Ser Ser Ser Gly Thr Leu His Cys 
180 185 190 

cct gac cag cca tec tec tgg gtg ggc tgc aag ctg age ctg gtc ttc 624 

Pro Asp Gin Pro Ser Ser Trp Val Gly Cys Lys Leu Ser Leu Val Phe 

195 200 205 

ctg cag tac tgc ate atg gec aac ttc ttc tgg ctg ctg gtg gag ggg 672 

Leu Gin Tyr Cys lie Met Ala Asn Phe Phe Trp Leu Leu Val Glu Gly 

210 215 220 

etc tac etc cac acc etc ctg gtg gee atg etc ccc cct aga agg tgc 720 

Leu Tyr Leu His Thr Leu Leu Val Ala Met Leu Pro Pro Arg Arg Cys 

225 230 235 240 

ttc ctg gec tae etc ctg ate gga tgg ggc etc ccc acc gtc tgc ate 768 

Phe Leu Ala Tyr Leu Leu lie Gly Trp Gly Leu Pro Thr Val Cys lie 
245 250 255 

ggt gca tgg act gcg gec agg etc tae tta gaa gac acc ggt tgc tgg 816 

Gly Ala Trp Thr Ala Ala Arg Leu Tyr Leu Glu Asp Thr Gly Cys Trp 
26 0 265 270 

gat aca aac gac cac agt gtg ccc tgg tgg gtc ata cga ata ccg att 864 

Asp Thr Asn Asp His Ser Val Pro Trp Trpi Val lie Arg lie Pro lie 

275 280 285 

tta att tec ate ate gtc aat ttt gt :; ctt ttc att agt att ata cga 912 

Leu Tie Ser lie lie Val Asn Phe Val Leu Phe lie Ser lie lie Arg 

290 295 :- 0 0 

att ttg ctg cag aag tta a:a tec cca gat gtc ggc ggc aac gac cag 96 0 

lie Leu Leu Gin Lys Leu Tnr Ser Pro Asp Val Gly Gly Asn Asp Gin 

3 05 310 315 32 0 

tct cag tac aag agg ctg gec aag tec acg etc ctg ctt ate ccg ctg 1008 

Ser Gin Tyr Lys Arg Leu Ala Lys Ser Thr Leu Leu Leu lie Pro Leu 

3 2 B 3 30 335 



355 



360 



365 



ctg gtg gtg gcc gtc etc tac tgt ttc ctg aac agt gag gtg cag tgc 



Leu Val Val Ala Val Leu Tyr 

3 7 0 3 7 5 



Phe Leu Asn Ser Gl' 
380 



Val Gin Cy. 



us: 



gag ctg aag cga aaa tgg cga age egg tgc ccg acc ccg tec gcg age 
Glu Leu Lys Arg Lys Trp Arg Ser Arg Cys Pro Thr Pro Ser Ala Ser 



38: 



3 90 



395 



400 



1200 



egg gat tac agg gtc tgc ggt tec tec ttc tee cac aac ggc teg gag 
Arg Asp Tyr Arg Val Cys Gly Ser Ser Phe Ser His Asn Gly Ser Glu 



405 



410 



415 



1248 



ggc gcc ctg eag ttc cac ege gcg tec cga gee cag tec ttc ctg caa 
Gly 7\la Leu Gin Phe His Arg Ala Ser Arg Ala Gin Ser Phe Leu Gin 



420 



425 



430 



1296 



acg gag acc teg gtc ate tag 
Thr Glu Thr Ser Val lie 
435 



1317 



< 2 1 0 > 
<2 11> 
<212> 
<213> 



20 

438 

PRT 

Homo sapiens 



;l> 



SIGNAL 
(1) . . (23) 
Potential 



<220> 
<221> 
<222> 



GENE 

(1) . . (438) 

Vasoactive Intestinal Peptide Receptor 2 



<220> 



CHAIN 

(24) . . ; 4 3 ; : ; ) 

Vasoactive Intestinal Polypeptide Receptor 



0 



DOMAIN 
(24 ) . . ( 126) 
1 (Potential) 



TRANSMEM 

■' 1 2 ^ . . f 1 ' 




< 2 2 0 > 

<221> TRAIJSMEM 

<222> (159) . . (178) 

<2 2 3:- 2 (Potential) 

< 2 2 0 > 

<221> DOMAIN 

< 2 2 2 > { 1 7 9 ) . . ( 2 0 3 ) 

<2 23 ;■ Extracellular { Potential ) 

<221> TRAIJSMEM 

<222> (204 ) . . (227) 

<2 23> 3 (Potential) 

< 2 2 0 > 

<221> DOMAIN 

<222> (228) . . (240) 

<2 23> Cytoplasmic (Potential) 

<220> 

<221> TRANSMEM 

< 2 2 2 > (241) . . (262) 
<22 3> 4 (Potential) 

< 2 2 0 > 

<221> DOMAIN 

<222> (263) . . (279) 

<223> Extracellular (Potential) 

<220> 

<221> TRAIJSMEM 

<222> (280) . . (303) 

<223 > 5 (Potential) 

<220> 

<221> DOMAIN 

<222> ( 304 ) . . (328) 

< - 2 3 > C \ * t c >p 1 a sm i c (Potential) 



■ . TRANSMEM 

■_22> ( ?2i*) . . (348) 

<^ 2 2 > 6 ( Potential ) 

<221> DOMAIN 

< 2 2 2 • (24 SO - . (360) 

<22j> Extracellular (Potential) 



;22 1> DOMAIN 



< 2 2 2 : 

< 2 2 3 

< 2 2 0 
<221> 

< 2 2 3 > 

< 2 2 0 > 

< 2 2 1 > 
<22 



(381) . . (438) 
rytoplasmic (potential) 



CARBOHYD 
(58) . . (58) 
N- linked ( GLCNAC . 



CARBOHYD 
(88) . . (88) 



. ) (Potential; 



<223> N-linked (GLCNAC...) (Potential) 



<22 0> 

<221> CARBOHYD 

<222> (92) . . (92) 

<223> N-linked {GLCNAC. 



) (Potential) 



< 4 0 0 > 2 0 

Met Arg Thr Leu Leu Pro Pro Ala Leu Leu Thr Cys Trp Leu Leu Ala 
15 10 15 



Pro Val Asn Ser lie His Pro Glu Cys Arg Phe His Leu Glu lie Gin 
20 25 30 



Glu Glu Glu Thr Lys Cys Ala Glu Leu Leu Arg Ser Gin Thr Glu Lys 
35 40 45 



His Lys Ala Cys Ser Gly Val Trp Asp Asn lie Thr Cys Trp Arg Pro 
50 55 60 



Ala Asn Val Gly Glu Thr Val Thr Val Pro Cys Pro Lys Val Phe Ser 

6 5 70 75 80 



As:i Phe Tyr Ser Lys Ala Gly 
85 



311 lie Sei" Lys Asn 
90 



ys Thr Ser Asp 
95 



Gly Trp Ser Glu Thr Phe Pro Asp Phe Val Asp Ala Cys Gly Tyr Ser 
100 105 110 



Asp Pre Glu Asr Glu Ser Lys Tie Thr Phe Tyr Tie Leu Val Lys Ala 



Ser lie lie Leu Cys Leu Phe Arg Lys Leu His Cys Thr Arg Asn Tyr 
145 150 155 160 



lie His Leu Asn Leu Phe Leu Ser Phe lie Leu Arg Ala lie Ser Val 
165 170 175 



Leu Val Lys Asp Asp Val Leu Tyr Ser Ser Ser Gly Thr Leu His Cys 
180 185 190 



Pro Asp Gin Pro Ser Ser Trp Val Gly Cys Lys Leu Ser Leu Val Phe 
195 200 205 



Leu Gin Tyr Cys lie Met Ala Asn Phe Phe Trp Leu Leu Val Glu Gly 
210 215 220 



Leu Tyr Leu His Thr Leu Leu Val Ala Met Leu Pro Pro Arg Arg Cys 
225 230 235 240 



Phe Leu Ala Tyr Leu Leu lie Gly Trp Gly Leu Pro Thr Val Cys lie 
245 250 255 



Gly Ala Trp Thr Ala Ala Arg Leu Tyr Leu Glu Asp Thr Gly Cys Trp 
260 265 270 



Asp Thr Asn Asp His Ser Val Pro Trp Trp Val lie Arg lie Pro lie 
275 280 285 



Leu lie Ser lie lie Val Asn Phe Val Leu Phe lie Ser lie lie Arg 
290 295 300 



lie Leu Leu Gin Lys Leu Thr Ger Pro Asp Val Gly Gly Asn Asp Gin 

.?C5 310 3 1b 320 



Ser Gin Tyr Lys Arg Leu Ala Lys Ser Thr Leu Leu Leu lie Pro Leu 
325 330 335 



Phe Gly Val His Tyr Met Val Phe Ala Val Phe Pro lie Sei lie Ser 
340 345 350 



Leu Val Val Ala Val Leu Tyr Cys Phe Leu Asn Ser Glu Val Gin Cy: 
370 375 380 



Glu Leu Lys Arg Lys Trp Arg Ser Arg Cys Pro Thr Pro Ser Ala Ser 
385 390 395 400 



Arg Asp Tyr Arg Val Cys Gly Ser Ser Phe Ser Arg Asn Gly Ser Glu 
405 410 415 



Gly Ala Leu Gin Phe His Arg Gly Ser Arg Ala Gin Ser Phe Leu Gin 
420 425 430 



Thr Glu Thr Ser Val lie 
435 



<210> 21 

<211> 889 

<212> DMA 

<2 13> Homo sapiens 

< 2 2 0 > 

<221> Gene 

<222> ( 1) . . (889) 

<223> Pleiotrophin 



<220> 
<221> 
< 2 2 2 > 
<223> 



CDS 
(252) 



;758) 



< 3 0 0 > 

<301> Li, Y.S., Milner, P.G., Chauhan, A.K., Watson, M.A., Hoffman, R.M., 
Kodner, CM., Milbrandt, J. and Deuel, T.F. 

Cloning and expression of a developmentally regulated protein that 
e s nitogenic a n d neuri t e o u tgro w t h a c t 1 v 1 1 \ * 
Sci ence 
2 b 0 
4 98 8 
j 1690-1694 
1990 

; 3 9 

:3 13> (1) . . (88 9) 



tcagrggtag caacctcgcc ccttgcaaca aaggcagact gagcgccaga gaggacgttt 240 

ccaactcaaa a atg cag get caa cag tac cag cag cag cgt cga aaa ttt 290 
Met Gin Ala Gin Gin Tyr Gin Gin Gin Arg Arg Lys Phe 
15 10 

gca get gec ttc ttg gca ttc art ttc ata rtg gca get gtg gat act 338 
Ala Ala Ala Phe Leu Ala Phe lie Phe He Leu Ala Ala Val Asp Thr 
15 20 25 

get gaa gca ggg aag aaa gag aaa cca gaa aaa aaa gtg aag aag tct 386 
Ala Glu Ala Gly Lys Lys Glu Lys Pro Glu Lys Lys Val Lys Lys Ser 
30 35 40 45 

gac tgt gga gaa tgg cag tgg agt gtg tgt gtg ccc acc agt gga gac 4 34 

Asp Cys Gly Glu Trp Gin Trp Ser Val Cys Val Pro Thr Ser Gly Asp 
50 55 60 

tgt ggg ctg ggc aca egg gag ggc act egg act gga get gag tge aag 4 82 

Cys Gly Leu Gly Thr Arg Glu Gly Thr Arg Thr Gly Ala Glu Cys Lys 
65 70 75 

eaa acc atg aag acc cag aga tgt aag ate ccc tgc aac tgg aag aag 530 
Gin Thr Met Lys Thr Gin Arg Cys Lys He Pro Cys Asn Trp Lys Lys 
80 85 90 

caa ttt ggc gcg gag tgc aaa tac cag ttc cag gec tgg gga gaa tgt 578 
Gin Phe Gly Ala Glu Cys Lys Tyr Gin Phe Gin Ala Trp Gly Glu Cys 
95 100 105 

gac ctg aac aca gec ctg aag acc aga act gga agt ctg aag cga gee 626 
Asp Leu Asn Thr Ala Leu Lys Thr Arg Thr Gly Ser Leu Lys Arg Ala 
110 115 120 125 

ctg cac aat gec gaa tgc cag aag act gtc acc ate tec aag ccc tgt 674 
Leu His Asn Ala Glu Cys Gin Lys Thr Val Thr He Ser Lys Pro Cys 
130 135 140 

ggc aaa ctg acc aag ccc aaa cct caa gca gaa tct aag aag aag aaa 722 

Gly Lys Leu Thr Lys Pro Lys Pro Gin Ala Glu Ser Lys Lys Lys Lys 

145 15 0 155 

aag gaa ggc aag aaa cag gag aag atg ctg gat taa aagargtcac '^68 
Lys Giu Gly Lys Lys Gin Glu Lys Met Leu Asp 
16 0 16 5 

ctgtggaaca taaaaaggac atcagcaaac aggateagtt aactattgea tttatatgta 828 

cegtaggett tgtattcaaa aattatctat agctaagtac acaataagca aaaacaaaaa 888 



'3 



88^ 




<220> 

<221> SIGNAL 

<222 > (1) . . (32) 

< 2 2 3 > 

< 2 2 0 > 

<22 1> GENE 

<222> (1) . . (168) 

<22 3 > Pleiotrophin 

< 2 2 0 > 

<221> GHAIN 

<222> (33) . . (168) 

<223> Pleiotrophin 

< 2 2 0 > 

<221> DISULFID 

<222^ (47).. (76) 

<223> By similarity. 

< 2 2 0 > 

<221> DISULFID 

<222> (55) . . (85) 

<223> By similarity. 

< 2 2 0 > 

<221> DISULFID 

<222> (62) . . (89) 

<223> By similarity. 

< 2 2 0 > 

<221> DISULFID 

<222> (99) . . (131) 

<223> By similarity. 

< 2 2 0 > 

<22I> DISULFID 

<222> (109) . . (141) 

<22 3> By similarity. 



Met. Gin. Ala Gin Gin Tyr Gii; Gin Gin Arg Arg Lys Phe Ala Ala Ala 
15 10 15 



Phe Leu Ala Phe He Phe He Leu Ala Ala Val Asp Thr Ala Glu Ala 

2 0 2 5 3 0 



SO 55 60 



Gly Thr Arg Glu Gly Thr Arg Thr Gly Ala Glu Cys Lys Gin Thr Met 

6 5 70 75 80 



;/s Thr Gin Arg Cys Lys lie Pro Cys Asn Trp Lys Lys Gin Phe Gly 
35 90 95 



Ala Glu Cys Lys Tyr Gin Phe Gin Ala Trp Gly Glu Cys Asp Leu Asn 
100 105 110 



Thr Ala Leu Lys Thr Arg Thr Gly Ser Leu Lys Arg Ala Leu His Asn 
115 12C 125 



Ala Glu Cys Gin Lys Thr Val Thr lie Ser Lys Pro Cys Gly Lys Leu 
130 135 140 



Thr Lys Pro Lys Pro Gin Ala Glu Ser Lys Lys Lys Lys Lys Glu Gly 
145 150 155 160 



Lys Lys Gin Glu Lys Met Leu Asp> 
165 



<210> 23 

<211> 3143 

<212> DNA 

<213> Homo sapiens 

< 2 2 0 > 

<221> Gene 

<222> (1) . . (3143) 

<223; Osteopontin 

■ 2 0 : 

2 1' 1 .-■ CDS 

_1 .:• (3 74) . . (427) 



< 2 2 0 > 

21> CDS 



(537) . . (575) 



<221> CDS 

<222> (1232).. (1273) 
<22 3 > 

<220> 

<221> CDS 

<222> (1597) . . (1920) 

< 22 3 > 

< 2 2 0 > 

<221;> CDS 

<222;- (2305) . . (2709) 



< 3 0 0 > 

<301> Crosby, A.H., Edwards, S.J., Murray, J.C. and Dixon, M.J. 

<302> Genomic organization of the human osteopontin gene: exclusion of the 

locus from a causative role in the pathogenesis of dentinogenesis imperfecta 
type II 

<303> Genomics 

<304> 27 

< 3 0 5 > 1 
<306> 155-160 
<307> 1995 

< 3 0 9 > 

<313> (1) . . (3143) 



<4 00> 2 3 

ggggaagtgt gggagcaggt gggctgggca gtggcagaaa cctgatgaca caatctcgcc 60 

gcctccctgt gttggtggag gatgtctgca gcagcattta aattctggga gggcttggtt 120 

gtcagcagca gcaggaggag gcagagacag catcgtcggg accagactcg tctcaggcca 180 

gttgcagcct tctcagccaa acgccgacca aggtacagct tcagtttgct actgggttgt 240 

gcattcagct gaatttcatg gggaagtcca aattctaagg aaaaaaatgt ggtagtataa 300 

aaaggtat. ca ctgttgtaac ctatgaagat gtcagctatt cctttgaaat attttgcagg 3(5 0 

aaaactca^'t acc atg aga att gca gtg att. tgc ttt tg:; etc eta ggc 409 
Met Arg lie Ala Val lie Cys Phe Cys Leu Leu Gly 
15 10 

ate acc tgt gee ata cca gtgagtacag ttgeatctta aagaaaattc 457 
lie Thr Cys Ala lie Pro 

15 



ctaaaaaraa <~taaattata taett.eeata t act. aw aaa acattct.tat aatctt. tctt 



Lys Gin 

30 

cgagaggtgc aagaaacgta tttgctgcga tcaaatgagt tcatatttgt aaagcaattt 685 

gaaagagtgc ctagcccaca gtaagtgcta cataagagtt tgttaaatga atctgcaaaa 745 

aaaaaaaaaa ttacaaaaag gtacctaagg gtccgggtga ctatatgctt ccatcaagac 805 

tagtgaagaa tggttgtttt ttccattcat ccctacattt ctttttttaa taatgataaa 865 

catgcaactt tttcgtag ctt tac aac aaa tac cca gat get gtg gec aca 916 

Leu Tyr Asn Lys Tyr Pro Asp Ala Val Ala Thr 
35 40 

tgg eta aac cct gac cca tct cag aag cag aat etc eta gec cca cag 964 
Trp Leu Asn Pro Asp Pro Ser Gin Lys Gin Asn Leu Leu Ala Pro Gin 
45 50 55 

gtatttttaa acttctcata attaaactac agtgatgaaa gatagecaca ctcaggccat 1024 

ttgggetgct cagatgaatc ctgccctgcc tgctggcaaa catgtgctta ggacattgac 1084 

tgatctgeca tgttggcttc tctctgtgtt aagccatcca cagatgaggc tgaaaaataa 1144 

aaactgcttt ggattaaaaa ggttaacttt tgaataaaaa agctaggcat gtgtgatgcg 1204 

cactaacacg tgeeattect tcttcag aat get gtg tec tct gaa gaa ace aat 1258 

Asn Ala Val Ser Ser Glu Glu Thr Asn 
60 65 

gac ttt aaa caa gag gtaagttctc attttcaatc agaggeccat catgecttga 1313 
Asp Phe Lys Gin Glu 

70 

agagatgaaa gaaggcattg cctggattct cttctgatga aatttcatta gcaagttttc 1373 

cagctaattg gcagtctaaa acttgetcat aaataaaaca tgtatttact aaatatcaga 1433 

aatactaggt ttccteggat aacctaaaag ccatggtatg tactgtgaat geaaagattc 14 93 

tgaaactaaa taaaaagaaa gatagtaaaa gactaatgtg ctataaaggc taagggaaaa 1553 

Laaaaaccca ta:attaatt ttcccggcca tcttaatttt cag acc ctt cca agt It 08 

Thr Leu Pro Ser 
75 

aag tec aac gaa age cat gac cac atg gat gat atg gat gat gaa gat 1656 

Lys Ser Asn Glu Ser His Asp His Met Asp Asp Met Asp Asp Glu Asp 

■3 0 8 5 9 0 



110 115 120 

cat tct gat gaa tct gat gaa ctg gtc act gat ttt ccc acg gac ctg 1800 
His Ser Asp Glu Ser Asp Glu Leu Val Thr Asp Phe Pro Thr Asp Leu 
125 130 135 140 

cca gca acc gaa gtt ttc act cca gtt gtc ccc aca gta gac aca tat 1848 
Pro Ala Thr Glu Val Phe Thr Pro Val Val Pro Thr Val Asp Thr Tyr 
145 150 155 

gat ggc cga ggt gat agt gtg gtt tat gga ctg agg tea aaa tct aag 1896 
Asp Gly Arg Gly Asp Ser Val Val Tyr Gly Leu Arg Ser Lys Ser Lys 
160 165 170 

aag ttt cgc aga cct gac ate cag gtaaatcctt taacagacac acctgatggt 1950 
Lys Phe Arg Arg Pro Asp lie Gin 
175 180 

tctgactagc gctcaagtct aggaaaccac agtttgcata ttcattcatt cattcatcca 2010 

ttcattcatc cattcagcaa gaattcattc atattctact ttatgaccat tgaatacaaa 2070 

tctttttctg cttggcggtt tttgtaagtc tacataattt ctctctagat ttgattctca 2130 

aacacaattc tactttttga aatcctggat caaagtaaca tgctagtatt atttcageca 2190 

gatttagaca atttttagta taagatgacc taaaagctag agagtggaaa aggattacca 2250 

tattcccatc cctagccgtt catataatta ttcttcattt gtgccgtgat tcag tac 2307 

Tyr 



cct gat get aca gac gag gac ate acc tea cac atg gaa age gag gag 2355 

Pro Asp Ala Thr Asp Glu Asp lie Thr Ser His Met Glu Ser Glu Glu 

185 190 195 

ttg aat ggt gca tae aag gec ate ccc gtt gee cag gac ctg aac gcg 24 03 

Leu Asn Gly Ala Tyr Lys Ala lie Pro Val Ala Gin Asp Leu Asn Ala 
200 205 210 



cct t. ct gat tag gac age rat gag aaa gac aq: tat gaa acg agt cag 
Pro Ser Asp Trp Asp Ser' Arg Gly Lys Asp Ser Tyr Glu Thr Ser Gin 



: 4 5 1 



ctg gat gac -rag agt get gaa acc cac age cac aag cag tec aga tta 2499 

Leu Asp Asp Gin Ser Ala Glu Thr His Ser His Lys Gin Ser Arg Leu 

23 0 235 240 245 

tat aag egg aaa gee aat gat gag age aat gag cat tec gat gtg att 2 54 7 

Tyi Lys Arg Lys Ala Asn Asp Glu Ser Asn Glu His Ser Asp Val lie 

2 5? 255 26C 



Phe His Ser His Glu Asp Met Leu Val Val Asp Pro Lys Ser Lys Glu 
280 285 290 

gaa gat aaa cac ctg aaa ttt cgt att tct cat gaa tta gat agt gca 2691 

Glu Asp Lys His Leu Lys Phe Arg lie Ser His Glu Leu Asp Ser Ala 

295 300 305 

tct tct gag gtc aat taa aaggagaaaa aatacaattt ctcactttgc 2739 
Ser Ser Glu Val Asn 
3 10 

atttagtcaa aagaaaaaat gctttatagc aaaatgaaag agaacatgaa atgcttcttt 2799 

ctcagtttat tggttgaatg tgtatctatt tgagtctgga aataactaat gtgtttgata 2859 

attagtttag tttgtggctt catggaaact ccctgtaaac aaaagcttca gggttatgtc 2919 

tatgttcatt ctatagaaga aatgcaaact atcactgtat tttaatattt gttattctct 2979 

catgaataga aatttatgta gaagcaaaca aaatactttt acccacttaa aaagagaata 3039 

taacatttta tgtcactata atcttttgtt ttttaagtta gtgtatattt tgttgtgatt 3099 

atcttttgtg gtgtgaataa atcttttatc ttgaatgtaa taag 3143 



< 2 1 0 > 2 4 

<211> 314 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 

<222> (1) . . (16) 

<22 3> Potential 



< 2 2 0 > 

<2 21> Gene 

<222> ( 1) . . (314 ) 

<1:2?> Osteopontm 



CHAIN 

(17) 1314) 
Osteopont in 



SITE 

(15 9) . . (161) 

Cell attachment site 



2 2 1> CARBOHYD 

222> (106) . . (106) 

223> N-linked ( GLCNAC . . . ) (Potential; 
22 0 > 

22 1> VARSPLIC 

222> (31) . . (57) 

223 > Missing (In I so form C) 

2 2 0 > 

22 1> VARSPLIC 

222> (58) . . (71) 

223 > Missing (In I so form B) 



< 4 0 0 > 2 4 



Met Arq lie Ala Val lie Cys Phe Cys Leu Leu Gly lie Thr Cys Ala 
1 5 10 15 



lie Pro Val Lys Gin Ala Asp Ser Gly Ser Ser Glu Glu Lys Gin Leu 
20 25 30 



Tyr Asn Lys Tyr Pro Asp Ala Val Ala Thr Trp Leu Asn Pro Asp Pro 
35 40 45 



Ser Gin Lys Gin Asn Leu Leu Ala Pro Gin Asn Ala Val Ser Ser Glu 
50 55 60 



Glu Thr Asn Asp Phe Lys Gin Glu Thr Leu Pro Ser Lys Ser Asn Glu 
65 70 75 80 



Ser His Asp His Met Asp Asp Met Asp Asp Glu Asp Asp Asp Asp His 
85 90 95 



Val Asp Ser Gin Asp Ser lie Asp Ser Asn Asp Ser Asp Asp Val Asp 

- p t: - 1 p 



Asp Thr Asp Asp Ser His Gin Ser Asp Glu Ser His His Ser Asp Glu 
115 120 125 



Ser Asp Glu Lei; '.'a I Tin Asp Phe Pro Tin' Asp Leu Pro Ala Tin Glu 

IDA 11^ 1 4 f" 1 



Asp Ser Val Val Tyr Gly Leu Arg Ser Lys Ser Lys Lys Phe Arg Arg 
165 170 175 



Pro Asp lie Gin Tyr Pro Asp Ala Thr Asp Glu Asp lie Thr Ser His 
180 185 190 



Met Glu Ser Glu Glu Leu Asn Gly Ala Tyr Lys Ala lie Pro Val Ala 
195 200 205 



Gin Asp Leu Asn Ala Pro Ser Asp Trp Asp Ser Arg Gly Lys Asp Ser 
210 215 220 



Tyr Glu Thr Ser Gin Leu Asp Asp Gin Ser Ala Glu Thr His Ser His 
225 230 235 240 



Lys Gin Ser Arg Leu Tyr Lys Arg Lys Ala Asn Asp Glu Ser Asn Glu 
245 250 255 



His Ser Asp Val lie Asp Ser Gin Glu Leu Ser Lys Val Ser Arg Glu 

260 265 270 



Phe His Ser His Glu Phe His Ser His Glu Asp Met Leu Val Val Asp 
275 280 285 



Pro Lys Ser Lys Glu Glu Asp Lys His Leu Lys Phe Arg lie Ser His 
290 295 300 



Glu Leu Asp Ser Ala Ser Ser Glu Val Asn 

305 310 



1 0 ■ ■ 2 5 

11:- 259 

12 ■ PRT 

■ ..13 • Homo sapiens 

■■:220> 

-:221> Gene 

< 2 2 2 ( 1 ) . . ( 2 5 9 ) 

<223 > Carbonic Anhydrase domain of human carbonic anhydrase III 



Glu Leu Phe Pro Asn Ala Lys Gly Glu Asn Gin Ser Pro lie Glu Leu 
20 25 30 



His Thr Lys Asp lie Arg His Asp Pro Ser Leu Gin Pro Trp Ser Val 
35 40 45 



Ser Tyr Asp Gly Gly Ser Ala Lys Thr lie Leu Asn Asn Gly Lys Thr 
5 0 5 5 6 0 



Cys Arg Val Val Phe Asp Asp Thr Tyr Asp Arg Ser Met Leu Arg Gly 
65 70 75 80 



Sly Pro Leu Pro Gly Pro Tyr Arg Leu Arg Gin Phe His Leu His Trp 
85 90 95 



Gly Ser Ser Asp Asp His Gly Ser Glu His Thr Val Asp Gly Val Lys 
100 105 110 



Tyr Ala Ala Glu Leu His Leu Val His Trp Asn Pro Lys Tyr Asn Thr 
115 120 125 



Phe Lys Glu Ala Leu Lys Gin Arg Asp Gly lie Ala Val lie Gly lie 
130 135 140 



Phe Leu Lys lie Gly His Glu Asn Gly Glu Phe Gin lie Phe Leu Asp 
145 150 155 160 



Ala Leu Asp Lys lie Lys Thr Lys Gly Lys Glu Ala Pro Phe Thr Lys 
165 170 175 



Phe Asp Pre Ser Cys Leu Phe Pre Ala Cys Arg Asp Tyr Trp Thr Tyr 
180 185 190 



Gin Gly Ser Phe Thr Thr Pro Pro Cys Glu Glu Cys lie Val Trp Leu 
195 200 205 



Leu Leu Lys Glu Pro Met Thr Val Ser Ser Asp Gin Met Ala Lys Leu 

210 215 220 



245 250 255 



Ser Phe Lys 



<210> 26 

< 2 1 1 > 26 0 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> Gene 

<222> (1) . . (260) 

<223> Carbonic anhydrase domain of human carbonic anhydrase I 



<400> 26 

Ala Ser Pro Asp Trp Gly Tyr Asp Asp Lys Asn Gly Pro Glu Gin Trp 
15 10 15 



Ser Lys Leu Tyr Pro lie Ala Asn Gly Asn Asn Gin Ser Pro Val Asp 
20 25 30 



lie Lys Thr Ser Glu Thr Lys His Asp Thr Ser Leu Lys Pro lie Ser 
35 40 45 



Val Ser Tyr Asn Pro Ala Thr Ala Lys Glu lie lie Asn Val Gly His 
50 55 60 



Ser Phe His Val Asn Phe Glu Asp Asn Asp Asn Arg Ser Val Leu Lys 
65 70 75 80 



Gly Gly Pro Phe Ser Asp Ser Tyr Arg Leu Phe Gin Phe His Phe His 
S 5 9 0 9 5 



Trp Gly Ser Thr Asn Glu His Gly Ser Glu His Thr Val Asp Gly Val 
100 105 110 



Lys Tyr Ser Ala Glu Leu His Val Ala His Trp Asn Ser Ala Lys Tyr 

115 12 0 125 



14S 150 155 160 



Leu Asp Ala Leu Gin Ala lie Lys Thr Lys Gly Lys Arg Ala Pro Phe 

165 17 0 175 



Thr Asn Phe Asp Pro Ser Thr Leu Leu Pro Ser Ser Leu Asp Phe Trp 
180 185 190 



Thr Tyr Pro Gly Ser Leu Thr His Pro Pro Leu Tyr Glu Ser Val Thr 
195 200 205 



Trp lie lie Cys Lys Glu Ser lie Ser Val Ser Ser Glu Gin Leu Ala 
210 215 220 



Gin Phe Arg Ser Leu Leu Ser Asn Val Glu Gly Asp Asn Ala Val Pro 
225 230 235 240 



Met Gin His Asn Asn Arg Pro Thr Gin Pro Leu Lys Gly Arg Thr Val 
245 250 255 



Arg Ala Ser Phe 

260 



<210> 27 

<211> 337 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> Gene 

<222> (1) . . (337) 

1' 2 3 • > Carbonic anhydrase domain of human carbonic anhydrase VIX 



Met Leu Phe Ser Ala Leu Leu Leu Glu Val lie Trp lie Leu Ala Ala 
15 10 15 



Asp Gly Gly Gin His Trp Thr Tyr Glu Gly Pro His Gly Gin Asp Hi£ 

20 25 30 



50 55 60 



Gin Pro His Gly Tyr Asp Gin Pro Gly Thr Glu Pro Leu Asp Leu His 
65 70 75 80 



Asn Asn Gly His Thr Yal Gin Leu Ser Leu Pro Ser Thr Leu Tyr Leu 
85 90 95 



;ly Gly Leu Pro Arg Lys Tyr Val Ala Ala Gin Leu His Leu His Trp 
100 105 110 



Sly Gin Lys Gly Ser Pro Gly Gly Ser Glu His Gin lie Asn Ser Glu 
115 120 125 



Ala Thr Phe Ala Glu Leu His He Val His Tyr Asp Ser Asp Ser Tyr 
130 135 14G 



Asp Ser Leu Ser Glu Ala Ala Glu Arg Pro Gin Gly Leu Ala Val Leu 
145 150 155 160 



Gly He Leu He Glu Val Gly Glu Thr Lys Asn He Ala Tyr Glu His 
165 170 175 



lie Leu Ser His Leu His Glu Val Arg His Lys Asp Gin Lys Thr Ser 
180 185 190 



Val Pro Pro Phe Asn Leu Arg Glu Leu Leu Pro Lys Gin Leu Gly Gin 
195 200 205 



Tyr Phe Arg Tyr Asn Gly Ser Leu Thr Thr Pro Pre- Cys Tyr Gin Ser 

21C 215 220 



Yal Leu Trp Thr Val Phe Tyr Arg Arg Ser Gin lie Ser Met Glu Gin 
225 23 0 235 240 



Leu Glu Lys Leu Gin Gly Thr Leu Phe Ser Thr Glu Glu Glu Pro Ser 

245 250 255 




Glu Met Leu Ser Leu Gly 

290 



Val Gly He Leu Val Gly Cys Leu Cys Leu 

295 300 



Leu Leu Ala Val Tyr Phe 
305 310 



He Ala Arg Lys He Arg Lys Lys Arg Leu 
315 320 



Glu Asn Arg Lys Ser Val 
325 



Val Phe Thr Ser Ala Gin Ala Thr Thr Glu 
330 335 



Ala 



<210> 28 

<211> 22 

<212> DNA 

<213> Artificial sequence 
< 2 2 0 > 

<223> Primer 



<400> 28 

cagcagttgg atggaagagg ac 22 

<210> 29 

<2il> 22 

<212> DNA 

<213> Artificial sequence 




Primer 



c.ict.gagat t ctggcactat tc 




DNA 

Artificial sequence 



Pr imer 



<210> 31 
< 2 1 1 > 21 
<212> DHA 



!13> Artificial sequence 



< 2 2 0 > 
<22 3> 



< 4 0 0 > 31 

ttgactggct caggagtata g 



< 2 1 0 > 3 2 

< 2 1 i > 21 
<212> DIJA 

<213> Artificial sequence 

< 2 2 0 > 

<22 3 > Primer 



< 4 0 0 > 3 2 

ctgataatga gggctcccaa c 



<210> 33 

<211> 24 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> Primer 



< 4 0 0 > 3 3 

ctctgcactt cctggtaaaa ctct 



:.- DIJA 

Artificial sequence 



< 2 2 3 > P r i me r 



« 4 0.; - 3 4 

cagcagttgg atggaagagg ac 




< 2 2 0 > 

<223> Primer 



< 4 0 0 > 3 5 

ctctgcactt cctggtaaaa ctct 



